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New cyclic memories, e.g», Charge-Goupled~Devices 
(CCDs) and Magnetic Babbles, fill the wide gap of price 
and performance between semiconductor/core RAM, and 
magnetic disks and drums. We call these memories 
Electronic Cyclic Memories (ECMs) as informaticai bits 
move in such memories. The class of ECMs incltides dyna- 
mic and static shift -registers (SRs) as well. The clock 
rate is asstmied to be constant in a dynamic SR, whereas 
it can be varied and reduced to zero in a static SR. We 
use the terms IGM and SR interchangeably. 

Buffering of ECM will lead to a memory organization 
whose performance approaches that of a high speed memory and 
price/bit that of an unbuffered ECM. The theme of this 
thesis is the design of buffered ECMs which can be used with 
various access disciplines, namely FIFO and LIFO. 



¥e consider an EC3M of size N with a transfer rate 
of 1/Tg* Requests are assumed to follow Poisson distri- 
bution. The mean inter-request time is denoted by MHP. 

¥e define the performance-improvement (ii) of a buffered 
ECM over an unbuffered ECM as (1 - T^/T^) x 100 percent, 
where T^ and T^ are the mean access times in the buffered 
and the imbuffered ECMs respectively. 

¥e propose two schemes for designing buffered 
stack memories, one of them uses an ECM with a fixed 
clock rate, and the other uses an BGM with a variable 
clock rate. Mathematical models are developed for both 
the schemes. Eor the buffered stack memory with variable 
clock ECM, a queueing model is developed from first 
princii)les. Simulation study was carried out for both. 
Simulation results agree well with the analytical 
results. A performance-improvement (n) of more than 
95 percent for MRT > 100 T„ is obtained with a buffer 

o 

of size 6 for a constant clock ECM of size 512 whereas 
a buffer of size 12 is required for a variable clock 
ECM of the same size. 

In order to meet the requiremsKrt of a large storage 
with EIPO access discipline in various digital systems and 
to satisfy random request arrivals, we propose a buffered 
ECM organization with two small size buffers on input and 



output ends of the ECM. ECM with constant clock rate is 
preferred as an additional small size shift register is 
required to set a condition to initiate the recirculation 
of information in ECM with variable clock rate. 

¥e propose a mcKJified Skinner's model with a finite 
queue to model the above scheme, and develop a cemputation- 
ally convenient method to analyse this model. Simulation 
was carried out for the above scheme. Analytical values 
for the performance-improvement and the fractional loss 
of information agree with those obtained from the 
simulation. With the constant clock BOM of size 512» 
and mean inter- insert ion time and mean inter- retrieval 
time, MRU >_ 100 addition of buffers of size 6 exhibits 
the performance- improvement of more than 95 percent, and 
buffers of size 12 are required to achieve the fractional 
loss of insertions or retrievals less than 10 

¥e propose a novel technique for multiple insertions 
and deletions in an ICM. A single buffer of size 16 added 
to an ECM yields a mean request-service time less than 
0.75 cycle for mean burst-length < 4. 

Finally, we propose an associative memory 
organization consisting of variable clock BCMs and small 
size buffers- The ability to vary the clock rate and the 
high ratio of data area to key are^ in a record are exploited 
to reduce the size of the content addressable memory (GAM) 
which is an expensive ingredient in this system. 



¥e strongly feel that electronic cyclic memories , 
like CCDs and Magnetic bubbles will emerge as cost- 
effective memories in the design of computer systems. It 
is concliaded in this thesis that with appropriate buffering 
of ECMs, it is possible to design a variety of useful 
memory systems with excellent performance characteristics. 



chapter 1 

INTROPUCTIOy 


1.1 EMEESUrCE OP GAP-PIILER MEMORIES 

Gyst ©ns for information storage and retrieval^ data 

acquisition, non~nxameric processing, and stack oriented 

applications, e.g., multiprocessing, subroutine operations, 

language translation, dynamic programming problems etc. 

require large size monories. Econcmic considerations dictate 

the applicaticm of cheaper storage devices, like magnetic 

disk and drum. However, these have very long access times. 

In fact there is a wide gap of price and performance between 

core/soniconductor random access memory (RAM) and auxiliary 

memory, like magnetic disk and drum. Figure 1.1 is a famous 

price-performance chart given by Feth (1976). Two parallel 

lines have been drawn throiigh the regions of farthest 

advance of various technologies, roughly bracketing the 

prioe-per-blt range from the smallest memory component 

(say, chip) to the price-per-bit paid by the end user for 

a system. The difference is roughly an order of magnitude. 

Slope of these lines is about -0.56, indicating that price- 

—0 56 

per-bit is proportional to T^ * , where is the access 

time to the first bit. T^ denotes the acquisition time 
for a 2-K byte block. 1 projection of price and perfor- 
mance featiares of MOS HAM, gap-filler memories and MED 
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Access tiffie of MOS HiK (approx. 10“" sec.) is 

less than the access time of moving head disk (approx. 

■",—2 

10 sec.) by about four orders of magnitiafle. Fixed head 
disks are at the hipest performance end of magnetic 
recording technolo^. Price-per-bit difference between 
MOS RAM (approx. 0.2 cents/bit) and moving head disk 
(approx. 0.002 cents/bit) is of two orders of magnitiide. 
Bnergence of gap-filler memories, namely. Charge Gougpled 
Devices (CCSDs), Magnetic Bubbles, Domain Tip (DOT) and 
Electron Beam Accessed MOS memory (BBlM or BEAMOS) fills 
this price-performance gap (with access time of approx. 

10 ^ sec. and price approx. 0.02 cents/bit). These are 
faster than mechanically rotating disk and drtaa, and cheaper 
than random-access-memory. Some of the potentially attrac- 
tive features inherent in scxae of these technologies are low 
power requirements, modularity for small capacity, flexibi- 
lity, higher reliability, suitability for non-numeric 
processing, nonvolatility and ease in maintenance. 

1.2 EDECTRONIC CYCLIC MEMORIES 

Infoiraation in a cyclic monoiy rotates cyclically 
past a read /write port. Cyclic memories can be classified 
on the basis of the mobility of information stored in them. 
In magnetic disks ani droms, infoimation bits remain static 
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on a rotating storage medinm. In the case of CCDs, Magnetic 
bubbles and MOS shift-register memories, information bits move 
from one location to another in a static storage medium. This 
intrinsic property of mobility of infonaation can be expleited 
to perform some symbol manipulative operations within memory - 
a further extension of distributed processing concept [liee, 
1963 ]. Hereafter we call a memory of the latter kind as an 
Electrcaiic Cyclic Maaory (BCM). 

Eeth [ 1976 ] presents a stimulating review of l(39s. 
Special issues on EGMs are listed in the bibliography. 

1.2.1 CCDs 

Information is stored as the quantity of charge in 
a potential well in the depletion layer of a MOS structure. 

It is shifted by applying suitably timed clock pulses. GGODs 
operate in a thermal non-equilibrium state, the leakage 
current generate! tends to fill up the charge wells thus 
limits the storage time. These can operate over a range of 
transfer rates, typically 100 KHz to 5 MHz. Since cimrge 
transfer is quite efficient, register lengths of the order 
of 256 bits are practical before refreshing is required, 

Terman and Heller [l976] give an overview of CCD Memories* 

1.2.2 MAGNETIC BUBBLES 

Magnetic bubbles offer data storage by remanent 
magnetization, as ferrite cores do. Fewer masking levels 
withotrt alignment requirements in processing make the coat 
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per bit ccmparable with magnetic disk and drimi. Magnetic 
bubbles are mobile cylindi^ical domains that can be generated 
in a film of a magnetic garnet. These bubbles can be moved 
around by magnetic fields which can be generated by the 
deposition of suitable permalloy (soft magnetic material) 
pattern cm the film and the use of an in-plane rotating 
field. Interaction between currents and permalloy patterns 
enables one to generate, expand, contract, annihilate, split, 
switch or detect buobles. Memory and logic operations can 
be performed as a combination of these basic operations. 
Writing is a controlled generation of a bubble. Reading can 
be done optically or by magneto-resistive detection. Bias 
field is generated by permanent magnets whicsh results in 
nonvolatility of the magnetic biibble memory. The shift 
rate ccmimonly used is 100 KHz. Higher rates, such as 1 to 
10 MHz, are postxilated. Salzer [1976], Cohen [1975] snd 
Bobeck ,et .al. [1975] give a good account of Magnetic Bubble 
Memory devices and their applications. Juliussen [1976] 
disc\isses the bubble memory systems and their advantages 
based on device characteristics and their unusiial system- 
level properties. 

1 . 2.5 MOS SHIJT-REGISTBRS 

There are two kinds of MOS shift-registers (SRs), 
namejy, static ard dynamic. Static MOS SR employs bistable 
flip-flop as a tesic storage element. A current sensing 
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amplifier detects the signal ciirrent. In dynamic MOS-‘?R, 
information is maintained by charge storage. Charge on 
gate capacitance may leak through the junction capacitance 
which will limit effective charge-storage to a finite time 
period. Hence a dynamic charge-storage KOS SR memory needs 
periodic refreshing. Dynamic MOS SRa offer higher speed, 
less power dissipation and higher packing density than a 
static MOS SRs. Hoff and Mazor [1971] discuss the 
operation and the applications of MOS shift-registers 
from the designer’s view point. 

1.2.4 IMPACf OF ICMs 

Wensley [l975] explores the impact of Blectronic 
Cyclic Memories on computer architectirre. He mentions, in 
particular, multiple minicomputers using a large central 
ECM, main memory acting as cache to BCM, spooling of all 
I/O operations by ECM, improved response of in'^eractive 
data-base systems using superior access speed BCMs. 
Panigrahi [1976] places ECM at the third hierarchical level 
between MOS RAM and magnetic disk back up memory. Improve- 
ment in hierarchy performance results by virtue of disk 
buffering. 

1.3 BUPFERIHG OF ECM 

By a buffer memory we mean an intermeiiate digital 
store in which data is temporarily retained prior to 
routing to the final destination. It matches the data 
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rates of source{s} and aeceptor(s) with an objective to 
minimize the loss of information. It reduces the overheads 
expressed in terms of CPU-idl e- waiting time, interrupt 
process overhead, main memory cyclic stealing, etc. Finally , 
it synchronizes otherwise asynchronized source(s) and 
acceptor(s) [Philokyprou, 1975]* 

Sluck [1965] proposed scratchpad buffer memory 
consisting of CPU working registers, status registers, I/O 
control registers, and CPU local data buffer. This buffer 
memory is addressable by user programs. Wilkes [1965] 
proposed slave memory for'virtual to absolute address* 
conversion, request queues, instruction and operand 
lookahead, etc. This bixffer memory is invisible to user 
program. 

Hahn [1968] classifies buffer manories into three 
categories, namely: (a) regularizera (i.e. derandomizers), 

(b) accumtilators (i.e. reservoirs) and (c) synchronizers. 

He uses simulation as a design technique to design a 
buffer storage for multichannel neutron time-of-flight 
analysis. He makes use of multiple levels of buffering. 

Philokyprou [1975] attonpts to classify the buffer 
memories according to the service discipline, namely: 

(a) FIFO (Urst-In-First-Out ), (b) LIFO (Last-In- First-Out ), 

(c) ^IUin)OM, (d) PRIOEITI and (e) ORTHOGONAL. 
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1.3.1 EFFICIENT I}f?B OF ICM 

cheaper and faster electronic cyclic memories compete 
with low capacity magnetic disks when the price of the disk- 
controller is included (Figure 1.3). A microcomputer 
design with EGM has heen prcpos^ by Kartashev [l976]. 

There are numerous applications of electronic cyclic 
memories with various access disciplines, namely FIFO, LIFO, 
etc. 

This motivates us to propose efficient schemes of 
buffered EC5M to be used with various access disciplines, 
and to develop analytic design techniques. The intrinsic 
property of the mobility of information in BCM can be 
exploited to develop seme elegant schemes for widely used 
large size stacks and non-numeric applications. 

¥e propose schemes of buffered electronic cyclic 
memories with FIFO and LIFO access disciplines and analyse 
them. !^ch schemes and related analytic . design techniques 
are the main contributions of this thesis. A novel technique 
for insertion and retrieval of a string results in an improv- 
ed stack memoiy organization and a scheme for symbol manipu- 
lation. Hoff and Mazor [1971] propose an elementary scheme 
for adding and deleting data in sMft-regiater memory. 

Single bit can be inserted during a shift-period. Deletion 
operation results in packets of vacant positions. Our 
scheme enables multiple insertions during a shift period. 
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Multiple deletions can "be performed without creating 
packets of vacant positions. 

¥e propose a scheme for an inexpensive associative 
ECM which may be useful for interactive data-base systems. 
Improvement over Slotnick's logic-per-track concept [1970] 
results by virtue of the ability to vary clock rates in 
ECM. The proposed scheme will be less expensive than 

Slotnick’s on account of reduction in the size of content 

» 

addressable memory which is a very expensive ingredient. 

1.4 GENERAL SPAPa^EHT OP PROBLEMS 

We propose two schemes for buffered stack memory. 

In one scheme, it is assumed that the clock rate of ECM can 
be varied and reduced to zero as found in magnetic 
bubbles and MOS static shift-registers. In another scheme, 
the clock rate of ECM is a^isumed to be constant. The 
scheme for buffered serial storage employe ECM with constant 
clock rate. The ability to vary the clock rate facilitates 
the reduction in power consumption. However, in moat of 
the applications, the assumption of constant clock rate 
resialts in a simplified design of the logic circviitry and 
the better system performance. 

Requests for insertion and retrieval are considered 
to follow Poisson distribution. Inter-request times are 
expressed in teime W SR shift period, Tg. Requests are 
asstaoed to be satisfied instantaneously from the buffer. 
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Analysis is carried out with an assumption that a single 
word is either inserted or retrieve. 

The terms -- SR and E(M, are used interchangeably. 
Assuming an old familiarity of the reader with shift-registers, 
the term, SR is frequently used in the text and on the diagrams 

1.5 AH OVERVIEW OP MATHEMATICAl MODELS FOR BUPPER-DESISH 

There is extensive literature on the behaviour of 
variotis buffer systems. In particular, H.M. Dor [1967] 
considers Poisson input and ccnstant output rates. He 
considers an imbedded Markov chain to develop the queueing 
model. Given buffer-length, and input and output rates, 
mean information-loss due to buffer- overflow is computed. 

W. Chu [ 1970 ] considers Poisson input and multiple {less 
than or equal to a fixed number) outputs at constant rate. 

He computes the overflow probability as the ratio of the 
difference of offered load and carried load to the offered 
load. D.G. Maritsas and K-G. Hartley [l970j ccmsider Si/D/l 
model. S. Tzafestas and G. Philokyprou [l969» 1970] consider 
En/PD/1 and M/PD/1 models. M denotes Poisscai arrival, Si 
Erlangian arrival, D constant service distribution, 

PD periodically consta|i|; service distribution, and 
single server. They find the fractional loss of informa- 
tion (Rj^) and the mean queue length (q) for a buffer of 
length L and givq the following overflow condition, 
iCl-Rj.) < Irl. CoB^utatiom of is based on Dor's woik: 

[1967]. In the above studies, the buffer is ccnaidered 
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word organized as each word (data-tinit) is treated 
separately. Each word enters the buffer and then it 
propagates through the buffer until it joins the end of 
the existing queue. 0^ Halatais et.al.[l974] study the 
statistical behaviour of the block organized buffer mder 
Poisson input and periodic block output. Blocks are 
considered of fixed size (k), A buffer consists of » such 
blocks. Their analysis differs from Chu's [l970] only in 
queueing discipline, i.e. the former is for block- or^niz^d. 
buffer whereas the latter is for word-organized buffer. 

A si m i l ar model is proposed by B. Powell and B. Avi-Itzhak 
[1967]. In this model, there is no restriction chi queue 
length. 

H. Asai and S.C. Lee [1975] consider time-varying 
probability distribution function of data generation, and 
ccxQpute the probability of congestion in a buffered system. 

G. Philokyprou and D.G. Maritsas [1974] cmsider heterogeneous 
input and output processes. In their model an input burst is 
followed by an output burst. No output occiars while an 
input is in progress. No new input burst can occur before 
an output burst has taken place. They derive a relation 
between buffer length and overflow probability. Their 
development is based on Chu’s work [l970], 

AH these buffer designs are for PIPO service 
discipline* They are developed around the concept of 
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imbedded Markov chain. D.K.G. Wishart [i960] stiadies 
distributions of queue length and busy periods, and 
probability of congestion in M/G/1 queueing model with 
IIPO service discipline, and compares the results with 
those obtained in the case of FIFO service discipline. 

^o restriction on queue length is assumed in this model. 

Recently Skinner's model [1967] has been widely 
used in the analysis of drum storage units [s.H. Fuller 
and F. Baskett, 1975], and magnetic bubble manories 
[D.P. Bhandarkar, 1975]. Skinner [1967] proposes a 
queueing model with server- walking time. He considers 
unlimited queue length. Server inspects the queue and 
takes one of two decisions: if queue is empty, server 
beccmies inactive for a variable time, and if queue is 
nonempty, server begins a ’service- inspection* cycle. 

In this model, the points in time that eujoy the Markovian 
property are those instants at which the server inspects 
the queue. 

¥e propose a canputationally convenient method for 
the analysis of the buffered BCM with FIFO access disci- 
pline. A modified Skinner’s mcxiel is used which considers 
a queue of finite length in contrast with the classic 
Skinner’s model [1967] with no restriction on queue length. 
U.H.Bhat [1973] suggests computationally convenient 
analytic methods for the analysis of M/G/1 and G/M/s 
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finite queueing systems, where 6 denotes general distribution 
and s the number of servers. 

We develop an M/M/1 model from first principles 
for the snslysis of the buffered variable clock ECM (e.g,, 
static SR) with LIPO access discipline. 

1.6 SUM*iARY OP DISSERTATION 

This dissertation presents the schemes of buffered 
electronic cyclic memories with various access disciplines, 
namely, PIPO and LIPO. A novel technique for insertion aikl 
deletion in ECM is developed. This dissertation also analyzes 
these schemes so as to provide a designer with the analytical 
tools to prepare a design guide table. A brief outline of 
this dissertation is given below. 

Pollowing this introduction, Chapter 2 presents the 
design of a buffered serial memory consisting of a large 
size electronic cyclic memory and two small size buffers on 
both ends of the cyclic memory to cater to the input and 
output requirements. The design criteria to determine 
the buffer-length are the fractional loss of information on 
input and output ends and the improvement in performance 
over an"tmbuffered 

The input and the output biiffers are analysed iiMiependen- 
tly. Interaction between a buffer and the ICM takes place after 
a fixed period of time as the information is assumed to 
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recirculate at a fixed transfer- rate, A finite queue 
Skinni'r's model is used t ^ obtain the utilization of 
buffer. Simulation is conducted to verify the analytical 
results. It is concluded in this chapter that by adding 
buffers of size 6 bits to an SCM of 512 bits, the terminal 
performance of the reaxiltant memory tends to that of ideal 
FIFO memory of length 512 with 95 percent perfonnance- 
improvement for Mi^ 2.100 T^- 

Chapter 3 disctisses the design of a buffer^ stack 
memory consisting of an ECM with variable clock z^te 
and a small size buffer (fast-stack). Analytic, as well as 
simulation, results are presented to quantify the perfoi^ 
mance of the stack memory organization. Measure of 
perfonnance is cemsidered the improvement in access time 
over an entirely unidirectional SH. A mathematical model 
is developed from first principles. It is concltzded 
in this chapter that by adding a buffer t)f 12 bits to an 
BCM of 512 bits, the terminal performance of the resultant 
stack memory tends to that of ideal stack memory of length 
512 with 95 percent performance improvemait for MET >100 

mm 

Starting with Chapter 4, we investigate the 
possibility of exploiting the property of the mobility of 
information in BCM. Chapter 4 reports a novel technique 
for insertion and deletion of a string in an ECM. A iSHDall 
size buffer is employ^ to perform these operations. Such a 
scheme is useful for non-numeric processing. 
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Chapter 5 presents an interesting design of a 
buffered stack memory ccmsisting of a dyramic SH and a 
small size buffer. Buffer is periodically initialized to 
half of its capacity. Techniques of insertion and deleticaa 
in an ECM (Chapter 4 ) are emplqyed, to perform the initiali- 
zation. Analytic, as well as simvilation, results are 
presented to quantify the perfonaance of the scheme. 
Improvement {in acoess time) over an entirely liiidirec- 
tional SR is considered as the measure of performance. 
Strikingly, the improvement rises faster with an increase 
in buffer-^length(cf.Chapter 3)* It is concltded in this 
chapter that by adding a buffer of size 6 bits to an ECM 
of length 512 , the terminal performance of the resultant 
stack memory organization tends to that of ideal stack 
memory with more than 95 percent performance- improvement 

for MRT > 100 T . 

s 

CJhapter 6 describes a conceptual organization for 
an associative electronic cyclic memory. Ability to vaiy 
clock rate and the knowledge that the ratio of data area 
to key area is large are employed to reduce the size of 
Content Addressable Memory. An associative operation is 
performed in two phases, namely search and follow-up 
operations. Addition of a set of parallel- in-serial- out 
shift-registers enables one to perform string manipulative 
operations. Multiple (sequential) search operatitms over a 
woi^ during a single shift period enhances the parallelism. 

Chapter 7 summarizes the work of this dissert atiem. 
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BUFFERED SERIAL MEMORY WITH CLOCK ECM 

A scheme is proposed for a buffered serial memory 
consisting of a large size ECM and two small size buffers 
on both of the ends of the ECM. In order to determine the 
optimal length of buffers, two design criteria, namely the 
perfonnance- improvement over an unbuffered FIFO storage and 
the fractional loss of information due to biiffes?- overflow- 
are considered. 

2.1 IirrRQDUCTION 

A First“In~First-Out (FIFO) serial storage is used 
in many areas of digital system design. It serves as a 
reservoir when the retrieval rate is less than the average 
insertion rate over a (large) period of time. Electronic 
cyclic memories, e.g., MOS shift-register, CCDs and magnetic 
bubbles form a FIFO serial storage. We use the terms 
Electronic Cyclic Memory (ECM), and Shift-Register (SR) 
interchangeably. 

Insertion and retrieval request patterns to the 
storage are assumed to be independent random events. If 
insertion is made directly into SR at the instant of its 
arrival, packets of holes (vacant positions) will be 
created within SR resulting in the problem of identification 
of first data entry and of garbage collection as data is 
scattered over the entire length of SR. A retrieval request 
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cannot be satisfied until the first data entry is detected. 
Consecutive insertions will, in turn, facilitate consecutive 
retrieval of data. 

Hahn proposed a multi-level buffer storage schaoae 
for neutrem time-of-flight analysis [Hahn, 1968]# A 
typical three level buffer storage system by Hahn includes 
ultra high speed, high speed, and slow speed buffers. 
Internal structure of buffers is not discussed. ’Insertion 
into* takes higher priority over 'retrieval from' the high 
speed buffer. 

We propose a scheme to realize a FIFO storage 
consisting of a large size unidirectional shift-register 
memory (ECM) and two small size buffers on input and 
output ends. Buffers on input and output ends facilitate, 
independently, fast and simultaneous insertion and retrieval 
respectively. The input buffer is emptied into SR and 
similarly, the output buffer is filled from SR either 
periodically or at +he instants determined by some 
condition. Once the tail, that is the last entry, of 
the information-chunk (ic) leaves the input port, it can 
be back after an interval equal to N or more shift-periais. 
The same holds for the head, that is the first entry, of 
ic. The length of the buffers should be large mough so 
that the performance of the buffered FIFO storage 
(Figure 2.1(b)) approaches that of an ideal FIFO storage 
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consisting of an array of registers (Figure 2«l(a)) and 
its price/bit tends to that of the entirely unidirectional 
SR FIFO storage (Figure 2.1l,c}}. 

If the clock-rate is constant, the tail and the head 
of ic reach the input and the output ports respectively at 
a fixed interval of the recirculation period. 

Ihe ability to vary the clock rate of .an BGM, like 
static shift -registers and CCDs appear an attractive 
property. However, this will require a third buffer in 
order to provide a condition to initiate the recirculaticai 
of SR. This scheme may be desirable where reduction in 
power-dissipation is prime importance. Conaideraticai of 
constant clock-rate simplifies the logic circuitry and 
only two buffers suffice. In the scheme mentioned her e- 
below, we consider BGM with a constant clock rate, e.g.p 
dynamic shift-registers. 

2.2 DESCRIPTION OF THE SCHEME 

m shift-regist ers, each of N-bits, constitute a 
serial storage of N-words, m-bits/word. Figure 2.2 depicts 
a configuration of a single unidirectional dynamic shift- 
register (SR) with two buffers, namely and on input 
and output ends. An additional buffer of the same length 
can be used to store the activity bits for m buffers on 
the input aad and similarly cme on the output end. An 
activity bit is 0 for vacant position and 1 for ncm- vacant 
position. Use of such a buffer simplifies the design of the 
selector-unit (see Figure 2.3)« 
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The input buffer is, essentially, a larallel-In- 
Sorial-Out (PISO) shift-register (typicplly of size 
8-32 bits). The output buffer is a Serial-In-Psrallel- 
Out (SIPO) shift-register. A selector unit is associated 
with each buffer. Selector-1 is a combinatorial circuit 
to activate a switch in order to store an input-data into 
the rightmost vacant position of Sclect03>-2 makes 

the rightmost data location of B 2 available for instant ! 
retrieval. Counter indicates the distance of the tail, 
that is the last entry, of the information-chunk (ic) from 
the input port of SR. Counter indicates the distance of tl 
head, that is the first entry, of ic from the output port oi 
SR. Distance is measured in terms of number of shifts. Bactil 
shift-clock increments the distance by one. The size of * 

I 

ic can be obtained by the magnitude difference between j 

and G^* 

Pllp-flop SW^ is reset vhen SB becomes empty and aet 

when B_ becomes full while SR is still empty. Flip-flop 

SWg la set when SB becomes lull and remains reset otherwise. : 

I 

[i 

2.3 C 3 PERATICm i 

Let H be the size of SR and 1/Tg he its shift-rat o* 
Further, we assume the buffers of equal length, L. Figure 2-2 
depicts a configuration with N=1024 and L=8. 

As a request for insertion arrives, the occupancy 
of is examined. If is found fiall the request is 
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rejected otherwise an insertion is made into the rightmost 
vacant position of which is selected by SELECT Ofi-1 . On 
the other hand, a request for retrieval is rejected if B 2 
is found empty otherwise the rightmost data word stored in 
Bg is retrieval through SELECTOR- 2 and the corresponding 
location in B^ is left as a vacant one after decrementing 
the switch index by one. 

Pat hi (see Figure 2. ) is activated whenever SB 
becomes OTpty* This connects B^, directly to B^ acid data 
from B^ is immediately transferred to B^ until Bg gets ( 
full. As SR is empty, will be set when Bg gets full. 
Pathg is activated whenever is set and the tail of the j 
informat i(Ma-chunk (ic) reaches the input port of SR. Sow, | 

I 

data from B-, is transferred consecutively into SB mtil ! 

I 

B^ is left empty. Path? is activated whenever the head J 
of ic reaches the output port of SB and then data frem SB [ 
is transferred into Bg until Bg gets full. Rest of the j 
data stored in SR keeps recirculating. ! 

Counter Cj is incremented at every shisPt operation | 
during the entire cycle except when Path2 remains activated. 
C^^ stops counting up during the period of consecutive 
transfers fran into SR, It is to be noted that such 
a transfer was initiated when = 0, hence remains 
at zero during the activation-period of Path2. Similarly, 
counter Gg is incremented at every shift operation d tiring 
the entire cycle except when Pathj ramains activatol. Cg 
remains at zero during the activation-period of Path?* 
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2.4 MATHEMATICAL MODEL 

Bequests for insertion and retrieval are assumed to 

follow Poisson distribution which is the simplest arrival 

process facilitating mathematical analysis. Insertion- and 

retrieval requests are equally likely events. The Poisson 

process has been widely studied [w. Feller, 1968]. The 

fundamental properties of a Poisson arrival process are: 

any two time intervals of eqiial length experience an 

arrival with equal probability, and the ntjmber of airivals 

during disjoint intervals are independent random events. 

Inter-arrival times have the negative exponential density 

function Xe where A is the mean arrival rate, and the 

probability of k arrivals during an arbitrary interval of 
(XT)^ —XT 

time, T, is e . Poisson assumption facilitates 

analysis, and under certain conditions may be a reasonable 
approximation to reality. 

The inf ormat ion- chunk (ic) recirculates at a constant 
rate within SR. As the tail of ic reaches the input port 
of SR, the input end buffer is inspected and emptied into 
SR during consecutive shift periods. The tail leaves the 
buffer when it becomes empty and inspects it again after a 
fixed period of time, N.T„» which we call the relaxation 
period. Similarly, as the head of ic reaches the output 
of SR, the output end buffer is inspected and filled from 
SR during consecutive shift periods. The head leaves the 
buffer when it becomes full and inspects it again after the 
relaxation period, H»Tg* 
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A request for insertion is rejected when the input- 
buffer is found full. Similarly, a request for retrieval 
is rejected when the output buffer is found empty, 

2 . 5 DBS I GN METP HOD OLOGT 
2.5.1 SIZE OF SR 

Requirement for a large size shiffc-regist er is felt in 
the case of heterogeneous input and output processes. Para- 
meters in a homogeneous process remain constant in time, 
whereas they vary with time in a heterogeneous process 
C G.Philokyprou, 1974]. 

let 1 and ji^ be the mean request rates for insertion, 
and retrieval respectively over an interval of time t^ (see 

Figure 2.4). We may, possibly, define a period of insertion ! 

\ 

and retrieval patterns that repeats, let such a period be 
T consisting of k subintervals. 

k ' I 

T = I t . 
i=l ^ 

For heterogeneous input and output processes, 

N > max (2^) for J = l,2,...,k 

where Z- = f ^ ^ )t . 

3 111 

2.5-2 BIZB OF BUFFER 

In order to determine the optimal size of a buffer 
on input or output end, we define two measures of perfor- 
mance, namely mean access time i'S^) and fractional loss of 
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information To illustrate with an example, w© conside; 

the input buffer. Requests for insertion are satisfied 
instantaneously mtil the input buffer gets full. In the 
case where a request for insertion is required to wait wh^ 
the buffer is full, the mean access time is a suitable 
measure of performance. For a buffer of length L, the mean 
aoceSB tine (Tj,) is computed as the ratio of waiting time 
due to {l+l)-th request to the expected number of requests 
serviced diaring the relaxation period and the period of 
data-transf er from buffer to SR. Further, we define 
performance- improvement (p) as (1-T^/T^)xl00 percent, 

[■ 

where is the mean access time of an tinbuffered serial ; 
storage. 1 

There are many applications in which the fractional [ 
loss of information is a suitable measure of performance- j 
In this case, requests arriving after the buffer gets full 
are lost. ISalike the previous case, the generation of 
requests is not affected by the occurrence of buffer- 
overflow. The fractional loss of information (insertion- 
requests) is computed as the ratio of the number of requests 
lost to the total number of requests during the relaxation 
period and the period of data-transfer between buffer and SR. 

Input and output buffers interact with each other only 
when Pathl is activated which is a rare event. Hence both 
the buffers are assiamed to function independently. A 
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buffer is inspected and information is transferred during 
consecutive shift-periods until the condition to initiate 
the period of inactivity (relaxation) is reached. The 
relaxation period starts with a fixed initial conditicai 
and lasts for a fixed period of time. Wc, therefore, 
study the requests arriving at a buffer during the 
relaxation period. 

Design-guide tables are prepared for both ti and 
Rj^. Given h and MM?, one can obtain an optimal buffer- 
length from these tables so as to achieve D greater than 
a specified value and less than a specified ratio. 

2.6 PERFORMAilCE- IMPROVEMENT 

An ideal FIFO storage is an array of registers 
(refer to Figure 2.1(a)). Such a storage offers a zero 
mean access time. On the contrary, a FIFO storage 
consisting of entirely unidirectional SRs (Figure 2.1(c)) 
suffers from a large mean access time, that is l.T_/2 

O' 

where N is the size of 55R with a transfer-rate of l/r_- 

O 

A buffered FIFO storage (Figure 2.1(b)) shows the mean 
access time between these two ex+reme values. Ve define 
the performance-improvement (rj) of a buffered FIFO storage 
over an unbuffered SR FIFO storage as follows. 

T, 

T) = (1 - ^) X 100 percent 
^u 

where T^ = mean access time of the buffered FIFO storage, 
and T^ = mean access time of the unbuffered SR FIFO 


storage. 
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In tlie case of an ideal FIFO storage, = 0 and 

D 

hence t) = 100 percent. For an unbuffered SR FIFO storage, 
and hence a 0 percent, that iseans no improveoent. 

Probability distribution function of the time, t, 
for the n-th request is computed fr<» the nth convolution 
of probability density function of inter-request intervals. 


^{t < -c) = / 

o 



Mean waiting time of the n-th request during the relaxation 

I*- 


period, l.T , 

s 


NT 

/ HT 


® (n-1 ) I 



In our example, (L+l)-th request waits. Hence, 

J* 

j = o 3.* 

(I+I) 1+1 ^HT -XNT 

- -5— [1 - I — 3f— e M 

^ js* 0 ^ • 

[H.M. l^agner, 1973] 

Let r be the expected number of requests being 

satisfied during the relaxation period and the period of 

data-tran'^fer from btiffer to SR. Probability of btiffer- 

overflow, P^^t can be computed as follows. 


Cl - I 


1 -%*!? 
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Ve can write 

» k[A (L+k-l)T 

r = (L+1) + I S— exp[-X{L+k-l)Tg] 

k— i 

Hence the mean accees time, 

’’b = ®L+l/ ' 

and the perforroance-improveaent , 

11 = (1 - I 100 percent. 

The analyticalvalucs for the performance-improvement 

(t)) are recorded in Table 2.1 for N = 512, 1024 and, 2048. 

L = 2, 4, 6, 8, 12, 16 and 24; and MHT/f = 25,50,75,100,150, 

200, 250 , 300 , 350 and 4CK5. To illustrate with an exai^le, 

T) for L = 8. and MOT = 100 T can bo obtained from this table 

® * 

as 99.7 percent for N = 512 and 95.8 percent for S =< 1024* 

It is to be noted that n depends on the product 
p = XHT , The values for t) are fairly close within an 

O 

accuracy of 0.5 percent for a specif iod value of p obtained 
from different pairs of N and MOT. We define a constant f 
equal to 10.24. P can be expressed in terms of f, e.g., 
p = 5.12 = f/2 for (N « 512, MIT = 100) as well as (N » 1024, 
MOT = 200). is tabulat^J in Table 2.2 for different 




pairs of M and MOT 
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TABLE 2.1 Performance-improvement (t)) of a buffer«l 
serial memory Kith constant clock BCM. 
Analytical values, 

N = 512 


MRT (x T„) > 

8 


1 \ 

25 

50 

75 

100 

150 

200 

250 

300 

350 

400 

2 

43.9 

53.2 

62.4 

70.4 

81.9 

88.5 

92.4 

94 • 7 : 

96.2 

97.2 

4 

70.3 

89.6 

87.5 

92,6 

97.4 

98.9 

99.5 

99.8 

99.9 

99.9 

6 

81.5 

90.4 

96.0 

98.4 

99.7 

99.9 

100.0 

100.0 

100.0 100.0 

8 

87.8 

95.8 

98.9 

99.7 

100.0 

100.0 





12 

94.4 

99.4 

100.0 

100.0 







16 

97.7 

100.0 









24 

99.8 

100.0 









H = 1024 

2 

39.1 

43.5 

48.2 

53.0 

62.3 

70.4 

76.9 

81.9 

85.6 

88.5 

4 

65.5 

70.0 

74.8 

79.5 

87.4 

92.6 

95.6 

97.4 

98-3 

98.9 

6 

76.7 

81.4 

86.1 

90.4 

96.0 

98.4 

99.3 

99.7 

99.9 

99.9 

8 

83.0 

87.7 

92.2 

95.8 

98.9 

99.7 

99.9 

lOO.t) 100.0 

100. C 

12 

89.7 

94.4 

98.0 

99.4 

100.0 

100.0 

100.0 




16 

93-2 

97.7 

99.7 

100.0 







24 

96.9 

99.8 

100.0 












K 






2 

36.7 

38.7 

40.9 

43-3 

48.1 

52.9 

57.7 

62.2 

66 • 5 

77.3 

4 

63-1 

65.2 

67.5 

69.9 

74.7 

79.4 

83.7 

87.4 

90.3 

92 « 6 : 

6 

74.4 

76.5 

78.9 

81.3 

86.1 

90.4 

93.7 

96.0 

97.5 

98.4 

8 

80.6 

82.8 

85.2 

87.6 

92.2 

95.8 

97.9 

98.9 

99.5 

99.7 

12 

87.3 

89.6 

92.0 

94.4 

98,0 

99 « 4 

99.9 

100.0 



16 

90.8 

93.2 

95.6 

97.7 

99.7 

100,0 





24 

94.5 

96.9 

99.0 

99.8 

100.0 







m 
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TABJbE 2. 2 



P 

in 

terms of 

f. 




'^RT(x T^) 

25 

50 

75 

100 

150 

200 

300 

400 

512 

2f 

f 

|f 

3 


if 

3 

4* 

if 

r 

1 

¥ 

1024 

4f 

2f 

4^ 

3 

f 

3 

1^ 

-f 

2 

if 

3 

If 

4 

2048 

8f 

4f 

Sf 

3 

2f 

3 

f 

■|f 

3 



2.7 PRACTIONAi LOSS CF INFORMATION 

In order to determine the fractional loss of 
information, we define two terms, namely, offered load (0I») 
and carried load (CL). OL is the expected number of requests 
arriving during the relaxation period, N.T^, and ^ the period 
of data transfer between buffer and SR. CL is the expected 
number of requests which are satisfied. We can define the 
fractional loss of information, say Rj^, as follows: 

R^ « (OL - CL)/OL 

It is to be noticed that the probability of two or 
more requests arriving during a shift-period, T^, is less 
than 10 ^ for mean inter-arrival times 100. T_ and more. 
Therefore, we can consider that all the requests arriving 
during the period of data transfer between buffer and SR 
afe satisfied and hence there is the loss of information. 
Further we can write i 
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01 = ( 01 )^ + ( 01) 2 
Cl = (Cl)j^ +{C1)^ 

where (01)^ and (Cl)^^ are the offered anc the carried 
loads during the relaxation period, N.!r_* (01) and 

S c 

(Cl ) 2 are the offered and the carried loads during the 
period of data-transfer between buffer and SS. From the 
ass'uaaption that there is no loss of infornaticai during 
this period, (01)^ = (CDg. 

let 1 be the mean arrival -rate of requests for 
insertion and 1 be the length of buffer Relaxation 

period is initiated at the instant gets empty. Inser- 
tions during the relaxation period change the occupancy 
of the buffer. Occupancy of the buffer corresponds to the 
carried load. ¥e define, 

b^ = Brob[i items in the buffer, B^, at the end of 
the relaxation period]. 

(Xm ) 

bi = — „ g - cxp(- XNfg) for i = 0,1, ...,(1-1) 

and 

« (X NT )^ 

®1 “ I ^ gyS — exp(- ^KTg) 

(01)^ = XHTg. 

® h [x(i+k-l)T 

(OL)^ = I I \ ^ — —mL exp[-x(i+k-l)T J 

isl ki 8. ^ 

(ci)^ « (oDg 
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L 

(CL) = I i b 
i=l 

^ + (oDj 

In the case of the output buffer, we define, 

= i>rob[i items in the buffer, at the end 
of the relaxation period] 


= Prob[ (L-i) items leaving the buffer during 
the relaxation period] 

(ODj^ = NTg 

I-l 

(GL)t = I (L-i).c, 
i=o ^ 


where p is the mean arrival rate of retrieval requests, 
can be obtain©3 simply by replacij^ 1 by p in 
the expressions derived for input buffer. Porlss p, 

Cf = will be identical for input and output 

buffers of equal length. 


In the case of the het erogeneoiis process, X or 

' IBSjE 

chosen to cempute R-. 

m&n. Xi 

inalytical values for the fractional loss of 
information, 1^, in the bufferoi serial memory with 
constant clock ECM are recorded in Table 2.3 for N « 512, 
1024 and 2048} L » 2, 4, 6, 8, 12, 16, 20, 24, 28 and 
32} and MBT/Tg = 25, 50, 75. 100, 150, 200, 250. 300 , 
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TABLE 2.3 Fractional Iops of information (Rj^) in, Buffered 




aerial 

memory 

with conatant clock 

BCM • 





Analytical val-jiea. 




MRT(x 

T3) 


N 

= 512 




l\ 25 

50 

75 

100 

150 200 250 

300 

350 

400 

2 -8985 

.8014 

.7056 

.6153 

.4646 .3553 .2776 

. 2217 

.1806 

.1496 

4 .7978 

.6055 

.5304 

.2959 

.1410 .0722 .0400 

.0237 

.0149 

.0098 

6 . 6981 

.4183 

.2136 

.1046 

.0276 .0088 .0033 

.0014 

.00066 

.00034 

8 .5992 

.2548 

.0831 

.0269 

.0036 .00069 .00017 

.00005 

.00002 

.00001 

12 .4053 

.0581 

.0049 

.00073 

.00002 .00000 




16 .2276 

.0064 

. 00017 

. 00001 

.00000 




20 .0959 

.00036 

. 00000 






24 .0280 

. 00001 







28 .0055 

.00000 







32 .0007 











N 

= 1024 




2 .9491 

.9005 

.8518 

.8031 

.1010 .6166 .5355 

. 4654 

.4060 

.3569 

4 .8985 

.8014 

.7042 

. 6081 

.4321 .2970 .2036 

.1414 

.1002 

.0724 

6 . 8481 

.7027 

.5579 

.4208 

.2148 .1051 .0527 

.0277 

.0153 

.0088 

8 .7979 

.6045 

.4156 

.2568 

.0836 .0270 .0094 

.0036 

.0015 

.0007 

12 .6981 

.4106 

.1745 

.0587 

.0059 .00073 .00011 

.00002 

.00000 


16 .5992 

.2315 

.0445 

.0065 

.00017 .00001 .00000 




20 .5011 

.0978 

.0064 

.00036 

.00000 




24 .4039 

.0286 

.00053 

.00001 

. 00000 




28 .3O8I 

.0056 

. 00072 

. 00000 





32 .2163 

.00072 

.00000 









N 

= 2048 




2 .9745 

.9502 

.9258 

.9014 

.8527 .8039 .7555 

.7077 

.6614 

.6171 

4 .9591 

.9005 

.8518 

.8O3I 

.7057 .6093 .5170 

.4329 

.3596 

.2975 

6 .9238 

.8509 

.7779 

.7049 

.5596 .4221 .3052 

.2154 

.1506 

.1054 

8 .8985 

.8014 

.7042 

.6070 

.4179 .2578 .1489 

.0839 

.0473 

. 0271 

12 .8481 

.7027 

.5572 

.4131 

.1755 .0590 .0186 

.0060 

.0020 

. 00073 

16 .7979 

.6044 

.4113 

. 2334 

.0448 .0067 .00099 

.00017 

.00003 

.00001 

20 .7479 

.8066 

.2705 

.0990 

.0065 .00036 .00002 

.00000 



24 .6981 

.4091 

.1485 

.0290 

,00053 .00001 .00000 




28 .6486 

.3128 

.0056 

. 00002 

.00000 




32 .5992 

. 2200 

.0203 

.00073 

.00000 






58 


350 and 400. To illus+ratt. with an example, for 

L = 8 and = 100 T„ can be obtained from this table 

s 

as 0.02686 for II = 512 and 0.2568 for K = 1024. It 
is to be noted that Rj^ also depends on the product 
p = ANTg. The values for Rj^ are fairly close within 
an accuracy of 0.5 percent for a specified value of p 
obtained from different paira of N and MRT. p can be 
expressed in terms of f (refer to Table 2.2). Figure 2.5 
is a plot cf Rj^ versus buffer-length for P = 4f , 2f , f , 

|f, |f and |f. 

2.8 VERIFICATION OF THE ABOVE MODEL 

Discrete-event-simulation [ G. S. Fishman, 1973] 
of the proposed scheme was carried out. A simulation 
program was written in GPSS-III and processed on 
IBM 7044. The whole system is subdivided into two 
practically i3ad spend ent subsystems: (1) Input buffer of 
length L, Poisson arrival, server-relaxation period of 
NT„, and constant service time, T„, (2) Output buffer 
of length 1, constant inte]>-arrival time, T_, source- 
relaxation period of NT^ and exponential service time. 

The first subsystem on the input end can be formulated as 
an M/6/1/L queueing model, i.e., Poisson arrival (M), 

General service time distribT:tion (G), one server (1) and a 
queue of finite length (D). Input to the buffer is assumed to 
be fr cm an infinite source of population. Another subsystem on 
the output end can be formulated as G/M/l/D queueing 
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model, i.e., general inter-arrival time (G), exponential 
service time (M), one se3rver (1) and a queue of finite 
length (L). Data stored in 'JR is the source of input 
to the buffer. It is available to fill the buffer periodi- 
cally. For practical purposes (i.o., for small values of 
1), it can be treated as an infinite source of population. 

In the case of wait-system, an incipient request 
(transacticai) examines the occupancy of buffer (storage 
entity). It gets immediate access if the buffer is not 
full, otherwise the request waits until the relaxation 
period is over. Waiting time for the earlier requests 
is zero. The time lapsed between the generation and the 
service instants is tabulated. Mean of the tabulated 
waiting time is the mean access time (T^) which is used 
to compute the performance- improvement (ti)* Each 
simulation experiment runs for 5000 requests. A request 
is for insertion at the input end, whereas it is for 
retrieval at the output end. Hence MET denotes mean 
int er- insert ion time for M/G/1/1 model and mean inter- 
retrieval time for G/M/l/L model. 

The confidence-interval for the simxilation results 
is computed as follows. For a fairly large sample of 
size 3, the variance of the sample mean is estimated 
as 0^/3 and the distribution of the sample mean can be 
regarded as normal. Given the sample mean, m, and the 
sample- variance, the 95 percent confidence interval, C 
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for the peculation mean lies between m - 1.9Sa/}fS and 
m + 1.96o/fS. obtain mean aceess tine, T^, aiKS standard 
deviation, o, from the sinulation and compute t) and its 
95 percent confidence interval, c = . The 

limit of a confidence interval should be interpreted 
such that Tj never exceeds 100 percent. 

Tables 2.4(a) and (b) compare the analytical 
values of T^ and t] for I = 2,4,6,8,12, and 16; MiiT/Tg=25, 
50, 75, 100, 150, 200, 300 and 400; and N = 512 and 1024. 

Fig^lre 2.6 is a plot of t) versus 1. Solid lines 
correspond to the simulation whereas dashed lines to the 
analytical results. The pair of curves marked with A 
corresponds to KRT = 25T„. Similarly, those marked with 

o 

B and C correspond to MET = 50 T„ and 100 T^ respectively. 
Simxilation results agree well with analytical results. 

The input buffer is formulated as an M/Gr/l/l model 
and the output buffer as a G/M/l/L model. Simulation 
study was carried out for both of these models. The 
values of obtained from the simulation of these models 
are ecaepa red with those obtained analytically. These three 
values of R^ are recorded together as (q, (s), [s]) for 

1 = 2, 4, 8, 12, 16 and 24; and MRT/Tg = 100, 200, 300 

s 

and 400 in Tables 2.5(a) for R = 512, (b) for N = 1024 
aod (c) for R = 2048, q correspcaide to the analytical 
value, (s) and [s] correspond to the values obtained 
from the simulaticm of M/G/l/B and 6/K/l/I* models 



respectively, fhese valiies Batch within an accuracy of 

0. 5 percent. Hence, c’lip.lity of these noc els is verified. 

!lte.ble 2.6 records for L = 2,4,8,12 and 16, and 
N = 512. The input process is considered to be a 
heterogeneous process whereas the output process a 
homogeneous process. MST alternates between two levels,*. 

1. e., X and 3x for heterogeneous process. Each level 
extends until 600 retrieval-requests arrive. MST rKuains 
fixed at 2x for homogeneous process. We consider 

X = ft) T„, 100 T„, 150 T„ and 200 T„. Initial occupancy 
of SE is considered to be H/5. Bach simulation experiment 
runs for 4800 requests. 

When MST is equal to x on the input and and to 2x 

on the output end, insertions are more than retrievals by 

roughly JCX) hence N » 300. Because of the availability 

of SRs in* sizes of 256, 512, 1024 etc., we consider 

N = 512. One can observe from Tables 2.3 and 2-6 that 

the fractional loss of insertion-requests with two-level 

METs, namely x and 3x is close to the average of for 

MET = X and 32 ; respectively. To illustrate with an 

exampiej we consider N = 512 and I» = 4 . In this coufi- 

guraticai, = 0.2959 for MET - 100 \ and = 0.0237 

for MET *= 300 Tg. Ej^ for the heterogeneous input process 

with 100 T_ and 3 OO T„ is tabxalatad as O. 1335 "whereas 
® I ® 

average value is 0.1598. Occupancy of SB reaching the 
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TABLE 2 -4 (a) P er f ona an ce- improvement (t]) of buffered 
serial memory with constant clock BCM. 
Ccmparison of analytical results with 
the simulation results. N = 512. 



SIMULATION 

ANALYT’ICAL 



SIMULATION 

analytical 

MET 
(x T„ 

) ^ 

^b 

T)+C 


9 

MET 

(X 

) ^ 


9±? 

T 

u 

25 

2 

111.9 56.7+4.2 

143.5 

43.9 

50 

2 

90.5 

64.9i4.2 

120.7 

53.2 


4 

65.5 

74 . 7 + 3.8 

76.1 

70.3 


4 

44.9 

82.9±3.9 

52.3 

79.6 


6 

44.6 

82 . 9 + 2.4 

47.3 

81.5 


6 

24.7 

90-7+2-0 

24.5 

90.4 


8 

31.9 

87.9+1.4 

21.3 

87.8 


8 

16.5 

93 . 8 + 1.5 

10.8 

95.8 


12 

16.7 

93.%1.0 

14.3 

94.4 


12 

6.6 

97.7+1.2 

1.4 

99.4 


16 

8.9 

96.9+0,9 

5.8 

97.7 


16 

4.7 

98.5±1.0 

0.1 

100.0 

15 

2 

77.0 

70 . 0 + 3.8 

96.3 

62.4 

100 

2 

68.9 

73.5±4.0 

75.7 

70.4 


4 

33.7 

87.2+2.8 

32.1 

87.5 


4 

26,6 

B9,9±2,9 

18.9 

92.6 


6 

14.9 

94-6+2.0 

10. 1 

96.0 


6 

9.9 

96.5+2.2 

4.1 

98.4 


8 

6.0 

97.%1.4 

2.7 

98.9 


8 

4.0 

98.9±1.1 

0.7 

99.7 


12 

3.0 

99 . 3 + 0.1 

0.1 

100-0 


12 

1.1 

99 . 6^.4 

0.0 

100.0 


16 

0.0 

100 -0.1 

0.0 

100.0 


16 

0.0 

100. 0-0.1 

0.0 

100.0 

150 

2 

52.6 

89,7+3.2 

46.4 

81.9 

200 

2 

37.6 

85.6+2.4 

29.4 

88.5 


4 

13.5 

95 . 0 + 2.6 

6.8 

97.4 


4 

6.2 

97.7+1.6 

2.7 

98.9 


6 

2.7 

99.3±1.2 

0.8 

99.7 


6 

0.8 

99 . 9 + 0.7 

0.2 

99.9 


8 

1.2 

99.7+0.9 

0.1 

100.0 


8 

0.3 

100.0-0.2 

0.0 

100.0 


12 

0.7 

99 . 9 + 0.5 

c.o 

100.0 


12 

0.0 

100,0-0.1 

0.0 

100.0 


16 

0.0 

la -0.1 

o.c 

lOC.O 


16 

6.0 

100.0 

0.0 

100. 0 

jm 

2 

26,8 

89.9*3,7 

13.5 

94.7 

400 

2 

14.3 

94 . 4 + 3.2 

7.2 

97.2 


4 

8.1 

96 , 9 + 3.0 

0.6 

99.8 


4 

" 1.2 

99.7+1.2 

0.2 

99.9 


6 

1.0 

99.7±0.7 

0.0 

100.0 


6 

0.5 

100.0-0.2 

0.0 

100.0 


8 

0,9 

99.9+0.2 

0.0 

100- 0 


8 

0.0 

100.fi 

0.0 

100. 0 


12 

0.2 

100. -0.1 

0.0 

100.0 


12 

0.0 

ICO .0 

0,0 

ICO .0 


16 

0.0 

100 . 0 

0.0 

100.0 


'16 

0.0 

100.0 

0.0 

lOo.O 
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Table 2.4(b) performance-improvement (t^) of biiffered 
aerial memory with constant clock ECM. 
Comparison of analytical results with 
the simulation results. N = 1024 




SIMULATION 

ANALITICAI 


SIMULATION 

anaiyticaST; 

nsT 

(X Tg 

L 

) 

h 



U 

MBT 

(X 

L 

) 


tl+e 

h 

U 

25 

2 

254.6 

54.3+4.0 

311.6 

39.1 

50 

2 

217-1 

57.7±4.0 

289.3 

43.5' 


4 

144.4 

71.9+2.8 

176.7 

65.5 


4 

131.2 

74.5+3.2 

153.5 

70. C 


6 

107.0 

79.3+2.6 

119.0 

76.7 


6 

83.6 

83,8(+2.2 

95.4 

81.4; 


8 

78.1 

84.8^1.8 

87.1 

83.0 


8 

60.8 

88.3+1.3 

63.0 

87.7 


12 

53.1 

89.7±0.7 

52.9 

89.7 


12 

29.7 

94.3±0.8 

28.8 

94.4 


16 

35.4 

93.2+0.4 

34.7 

93.2 


16 

11.7 

97.7+0.4 

11.7 

97.7j 

75 

2 

202.2 

60.6i;5.6 

265.1 

48.2 

100 

2 

186. 2 

63.7+5.2 

240.6 

53. Ci 


4 

111.8 

78.4^2.2 129.1 

74.8 


4 

89.3 

82.7i:2.2 

105.1 

79.5 


6 

66.5 

87.2+1.9 

71.2 

86.1 


6 

48.4 

90.7i:1.4 

49.3 

90,4 


8 

37.1 

92* 8+1# 2 

39.7 

52.2 


8 

25.8 

95.2i:1.3 

21.7 

95.6 


12 

14.7 

97.3±0.9 

10.4 

98.0 


12 

5.3 

99.1±1.0 

2.9 

99.4 


16 

6.1 

98.9±0.8 

1.8 

99.7 


16 

1.8 

99.9±0.5 

0.2 

100. c 

150 

2 

150.0 

70.9+2.7 

193.1 

62.3 

200 

2 128.9 

75.0+3.0 151.6 

70.4 


4 

60.6 

88,5+2.2 

64.4 

87.4 


4 

42.5 

91.8+2.1 

57.8 

92.6 


6 

28.7 

94.6+1.7 

20.3 

96.0 


8 

17.0 

96 # 9+1 • 4 

8.1 

98.4 


8 

10.5 

98.1+1.0 

5.4 

98.9 


8 

2.3 

99.6+0.4 

1,4 

99.7 


12 

6.0 

98.9+0.7 

0.2 

100.0 


12 

0.1 

100. 0-0. 2 

0.0 lOO.C 


16 

4.7 

99.5+0.5 

0.0 

100.0 


16 

0.0 

100.0 

0.0 

100. C 

500 

2 

96. 6 

81.5+2.2 

92.8 

81.9 

400 

2 

78.1 

84.8+2.6 

98.8 

88.5 


4 

21.6 

95.9+1.6 

13.6 

97.4 


4 

14 • 2 

97.3+1.5 

5.5 

98.9 


6 

4.5 

99.3±0.9 

1.5 

99.7 


6 

1.6 

99.9+0.2 

0.4 

99-9 


8 

1.1 

99.9+0.2 

0.1 

100.0 


8 

0.4 

100.0+0.2 

0.0 

100.0 


12 

0.1 

100.0-0.1 

0.0 

100.0 


12 

0.0 

100.0-0.1 

0.0 

100.0 


16 

0.0 

100# 0 

0.0 

100.0 


16 

0.0 

100.0 

0.0 

100.0 
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TaBIiE 2.5 Fractional Iopb of information. 

Comparison of analyticalvalues with simulation 
values of M/G/1/1 and G/M/l/L Models. 


N =: 512 


ME!r(x T) 
8 

1 » 2 

4 

8 

12 

16 24 


.6153 

.2959 

.0269 

.0007 

.0000 

100 

(.6122) 

(.2934) 

(.0298) 

(.0010) 

(.0000) 


[ . 6148] 

[.3007] 

[-0287] 

[.0006] 

[.0000] 


.5553 

.0722 

.0007 

.0000 


200 

(.5493) 

(.0706) 

(.0008) 

(.0000) 



[.3593] 

[.0730] 

[.0005] 

[ .0000] 



; . 2217 

.0257 

,0000 



500 

' (.2213) 

(.0240) 

(.0000) 




[.2213] 

[.0248] 

[ . 0000] 




.1496 

.0098 

.0000 



400 

(.1298) 

(.0080) 

(.0000) 




[.1493] 

1 1 

CD 

Ch 

O 

O 

« 

[.0000] 








TABLE 2.5 (Continued) 
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H = 1024 


MRTCx Tg) 

If «= 2 

4 

8 

12 

16 

24 

100 

.8031 
( . 8031 ) 

[.8046] 

i limai iiniiiii ....1 i., 1 11 . u .,. b . .11. 11 . juu ., - t . 

.6081 

(.6050) 

[.6124] 

. 2568 
(.2517) 
[.2696] 

.0597 

(.0576) 

[-0538] 

.0065 

(.0074) 

[.0081] 

.0000 

(.0000) 

[.0000] 

■ 

.6165 

(.6160) 

[.6142] 

.2970 

(.3006) 

[.3070] 

.0270 

(.0282) 

[ . 0290] 

. 0007 

(.0007) 

[.0006] 

.0000 

(.0000) 

[ .0000] 


500 

.4654 

(.4566) 

[.4624] 

.1414 

(.1396) 

[.1388] 

.0036 

(.0040) 

[.0033] 

.0000 

(.0000) 

[.0000] 



400 

.3559 

(.3680) 

[.3633] 

.0724 

(.0730) 

[.0755] 

.0007 

(.0010) 

[O.OO6J 

.0000 

(.0000) 

[ . 0000] 





N 

m 2048 




100 

r ■ ■ 

.9014 

(.8985) 

[.9016] 

.8O31 

(.8010) 

[.8002] 

.6070 
( . 6005 ) 
[-6122] 

.4131 

(.3905) 

[-4306] 

.2334 

(.2345) 

[ . 2378] 

^0290 

(.0280) 

[.0257] 

200 

.8039 

(.8075) 

[.8024] 

.6093 
(. ao5) 

[ . 6118] 

.2578 

(.2543) 

[.2666] 

'fi 

.0590 

(.0644) 

[.0626] 

.0066 

(.0058) 

[ .0074] 

.{X)00 

(.0000) 

[ . 0000] 

300 

. 7077 

(.7120) 

[.7056] 

.4329 

(.4315) 

[.4346] 

.0840 

(.0840) 

[ .0838] 

.0060 

(.0066) 

[.0070] 

.00017 

(.0000) 

[.0003] 

.0000 

(.0000) 

[,0000] 

400 

: I 

.6171 

(.6205) 

[.6141] 

-2975 

(.3093) 

[.3012] 

.0271 

(.0260) 

[.0225] 

.0007 

(.0006) 

[.0008] 

.0000 

(.0000) 

[ . 0000] 



Simulation results (.) correspcmds to M/G/l/L model and 

[.] correspcmds to G/M/1/1 model. 













TABLE 2*6 Heterogeneoua input and homogeneous 
Output procePBep. 

N = 512» 600 re+:rieval-reque«t per level. 


MET 

CAPACITY 

MEAN 

occupancy 

SR 

occupancy 

LOSS OF 

INFO. 

INS. 

RET. 

Bi 


Bl 

®2 

KEAN 

MAX. 

INS. 

SET. 

50 

100 

2 

2 

1.3 

1.6 

63 

128 

.7175 

.6113 



4 

4 

1.9 

1.6 

71 

192 

• 3478 

.3459 



8 

8 

2.2 

5-1 

119 

320 

.1079 

.0797 



12 

12 

2.5 

9.4 

175 

384 

.0283 

.0034 



• 16 

16 

2.4 

13.5 

204 

448 

.0027 

.0010 

100 

200 

2 

2 

0.9 

1.0 

69 

192 

.3981 

.3637 



4 

4 

1.1 

2.6 

102 

320 

.1335 

.1138 



8 

8 

1.1 

6.5 

176 

384 

.0131 

.0195 



12 

12 

1.2 

10.9 

257 

512 

.0004 

.0002 

150 

500 

2 

2 

0.6 

1.2 

63 

256 

. 2635 

.2531 



4 

4 

0.7 

3.2 

192 

448 

.0589 

.0331 



8 

8 

0.8 

7.2 

205 

384 

.0004 

.0008 



12 

12 

0.8 

11.1 

227 

448 

.0000 

.0000 

200 

400 

2 

2 

0.5 

1.4 

108 

256 

.1779 

.1588 



4 

4 

0.5 

3.5 

217 

448 

.0277 

.0068 



8 

8 

0.6 

7.5 

262 

512 

.(X)02 

.0005 
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value of its capacity can also be observefd from 
Table 2*6* For the buffers of equal length, the insertion- 
loss is more than the retrieval-loss. This b^ioves us to 
increase the size of the input buffer. 

2.9 BUFFER UTIlIZATICll 

In order to find the mean occupancy of buffer, we 
can employ Skinner’s model [C.B. Skinner, 1967] with a 
constraint that the queue is of finite length. Skinner 
considered a problem where the server walks away for a 
period of time when the queue becomes empty. Customers 
join a queue which may be of any length. We develop a 
computationally convenient analytic method for the analysis 
of two classes of finite queueing systems, namely, M/G/l/h, 
i.e.^oisson arrivals (M), general service time (G), tme 
server (1) and finite queue (h)j and G/M/l/X, i.e. general 
arrival (G), erpemential service time (M), one server (l) 
and finite queue (L). 

A customer corresponds to a request. Similarly a 
queue corresponds to a buffer. Service in M/G/l/If 
corresponds to transfer from BR. Arrival in 

G/M/l/I. corresponds to transfer from SR to Bg, and service 
time corresponds to inter-arrival time for retrieval-requests. 

2.9.1 M/G A/I WITH SBRVER-RBLAXAT I ON TIME 

is emptied into SR during consecutive shift- 
periods. The tail of the informat ion- chunk in SR leaves 




SUFFEI?-:1 


BiJFrER - 2 


Arrival 




Figur*' 2.7 


PINITO QUEUE SKIflllER'S MODELS FOR 

(«)■ M/G/i/L mm siRVER-RijyucAtioM ttm 

CL) G/n/l/L WITH SOURCE^-RSLiycIltTOli Tlf4» 







50 


the input port and inspects again after the relaxation 
period» Figure 2.1{a) depicts a finite queue Scinner's 

model [cf- G.E. !?kinner, 1967]. In our exsmple, relaxation 
period R = NT„» service time A = and latency period B = 0. 
Eequerta for insertion are rejeoted if is found full. 
Insertion requests follow Poisson distribution with mean 
arrival rate of ¥e define* 


and 


= Proh[i infonnation bits in B^ at the beginning 
of transfer from B^ "to SR] 


h^ = Prob[i information bits left in B^ at the instant 


of a transfer-completion] 


let pjj = IHT^, = XT^ 


-P„ P 


i+1 


Si = «o ® 


N N 


+ I 


il k=l 


®k 


e 


.0 p f+l—k 
11 


(i+l-k).‘ 


for i = 0,1, , (L-1) 


and 

o* —0 

„ _ „ T „ N 
% “ So . \ ® 

3=1 



li 

I gj, I 
k=l ^ j=l-k+l 




The last equation is linearly dependent on the previous 

ones aiKl hence can be replaced by the probability conses?- 

1 

vat ion constraint \ gr = !• The resulting set of 

i=c 

equations can be solved by iterative method. 
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After computing g , g , . 

O X 

follows; 


■”1 

kii I-So 


• » gj^» we can ccanpute as 

pi+1-1 

1 

{i+l-k)l 


for i = 0,1 


h 


1-1 


kll 


^■”®o j =^-i 



(1-2) 


1-2 


= (1 - I 

i=o 



Since insertion-request and transfer operation are indepen- 
dent, also denotes the probability of i-information bits 
in at the in.taht of arrival of an Insert ion-requeat. 
Thus, we can compute the mean buffer occupancy, as 

follows. 



1-1 

I (i+l)h^ . 


te* Mbl d\ 
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2.9.2 G/M/1/1 WITH SOUECE RfiLAXATION TIMB 

If wc reverse the direction oi flow of requests and. 
transfer of dota, the resulting qu..-ucing system (Figure 2.7(h)) 
is G/M/l/X, i.c., general inter-arrival times (G), exponential 
service times (M), one scivcr (l), and finite queue (i). In 
our example, information-chunk (ic) is the source of popula- 
tion, its head inspects buffer (queue). If the buffer is 
not full, service- inspect ion cycle begins. Transfer from 
SR into corresponds to service. Transfer takes places 
during consecutive ahi ft -periods . If the buffer is found 
full, ic begins an inactivity period of E which is the same 
as the relaxation period, NTg. Retrieval requests follow 
Poisson distribution with mean arrival rate of p. 

¥e define, 

= Prob[i infoimation bits in at the beginning of 

the transfer from SR into B j 

2 ' 

and 

hj^ = Prob[i information bits in at the instant 
of a transfer-completion] 

Refer to Figure 2.7(b), relaxation period R = 

s 

inter-arrival time, A = T^ and latency time, B = 0, Let 
- pHTg and = pT^. ¥e can derive the occupancy 
probabilities frem the following set of eqimtions, 
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(Ir-i), 


kii-1 ^ (k+l-i)! 
for i = 1,2,. . .,1, 


“O, 
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The first eqmtion is depcaident upon the rest of 

the ^auations. Hence it can be replaced by the probabi- 

L 

lity conservation constraint, that is 1 

i*o ^ 
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For = PjL and = p^, we obtain ^ 

and a! . = a,^ ^ \ 

13 (L-i)j 

After revriting the above set of eqiiations, we 
obtain a new set of eq\iations identical to that in the 
case of M/Q/l/lt, This holds for all vrJ.ues of Ii, Thus 
we conclude that M/6/l/Ii queueing system with sejrvci— 
relaxation time is dual of G/M/l/l queueing systea with 
source-relaxation time. 
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table 2.7 M/G/l/L Finite queue Skinner’a Model, K - 512- 

S : Simulation result 
A : ijoalytical result 

(a) MRT = 100 T 

s 

L q i=0 1 2 3 4 5 6 7 8 

2 sri.503 g. r.334 .335 .331 

A11.A97 ^ \.336 .337 ,327 

S f K503 .497 

At ^ 1*503 .497 

4 S ?2-377 g J'.219 .218 .212 .191 .160 

A \2.396 ^ U2I6 .217 .210 .193 .164 

S f ^ r.279 .271 .245 .205 

A I ^ 1.274 .268 .246 .211 

8 S 1*3.390 g r.l66 .16? .161 .148 .12§ .096 .068 .04I .026 

AL5-397 ^ [.166 .166 .161 ,148 .126 . 097 . 068 . 043 . 025- 

S r ^ p200 .193 .177 *151 .115 .082 .050 ,031 

a] ^ 1.197 .193 .178 .151 .117 .082 .052 .030 


0 2 4 6 8 10 12 14 16 


12 3 13.598 

A t3.563 

A 1 


16 3 13.563 

A b.570 


Si 1 
^i 1 

r.i62 

[.162 

^194 

[.192 

.160 

.158 

,171 

.173 

.120 

.123 

.111 

.114 

.070 

.066 

.051 

.050 

.020 

.024 

.016 

.015 

.006 

.006 

.003 

.003 

.0009 

.001 

gf 1 

r.i62 

.157 

.122 

.067 

.024 

.006 

.0009 .0000 .000 

[.162 

.158 

.123 

. 066 

f . Q 24 

.006 

,0010 .0001 .000 

ll t i 

f!l94 

.173 

.114 

.051 

,015 

.003 

. 0004 . 0000 

^i < 

y .92 

.173 

.114 

.050 

.015 

.003 

.0005 .0000 


MIT = Mean InterarriTal time L ** Kaxiaum length of the queue 

q = Mean queue length. 
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TABLE 2.7 (Continued) 


(c) MRT = 5C0 T„ 

O 


S : Simulati on result 
A i Analytical result 



HIT » Mean interarrival time 
q s! Mean queue length 


L = Maximum length of the queue 
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TABLE 2.7 (Continued) 


(d) MET = 400 T 


S : Simulation result 
A i Analytical result 


s 


i ass 


0 


8 



474 

.344 

.182 






478 

.346 

.176 






655 

.347 







662 

.338 







445 

.321 

.160 

.058 

.017 




440 

.312 

.162 

.061 

.018 




557 

.288 

.105 

.030 





558 

.289 

.109 

.033 





457 

.317 

.161 

.060 

.021 

.004 

.000 

.(XX) 

458 

.317 

.161 

.061 

.018 

.004 

.001 

.000 

563 

.285 

.107 

.037 

.007 

.001 

.000 

-000 

563 

.287 

.108 

.032 

.008 

.001 

.000 

.000 

0 

2 

4 

6 

8 

10 

12 

14 

435 

. I 64 

.021 

.001 

.000 

.000 

.000 


458 

.161 

.018 

.0009 

.000 

.000 

.000 


551 

.113 

.008 

.000 

.00 

.000 



563 

.108 

.008 

.0003 

.000 

.000 



436 

,160 

.016 

.001 

.000 

.000 

.000 

.000 

,458 

.161 

.0^8 

.009 

.000 

,000 

.000 

.000 

,568 

.108 

,007 

.0002 

.000 

.000 

.000 

.000 

563 

.1C» 

.008 

.0003 

.000 

.0(X) 

.000 

.000 



% 


8 S ri.644 „ 

A ^1.641 ^ 

S 


.000 

,000 


16 


.2 



Si 


L6 



Si 


h. 

X 


000 


MIT == Mean interarrival time 
i = Mean queue lengtii. 


L = Maximum length of the queue 
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2.9.3 VBRIPICATION OF T^HB FINITE QUEUE SKINNER'S MODEE 
In order to verify the analytical results of the 
M/G/l/E finite queue Skinner's model, the simulation 
study of the same is carried out. Bach simtolation 
experiment runs for 5000 requests. Mean buffer occupancy, 
q; and occupancy probabilities, and h^, are recojnied 
in Tables 2.7(a-d) for K = 512; L = 2, 4, 8, 12 end 16; 
MRT/P„ = 100, 200, 300 and 400. 

It is observed that the analytical results agree 
well with those obtained from the simulation. 

2.10 DISCUSSION OF RESULTS 

¥e consider two measures of performance, namely, 
mean access time and fractional loss of infonaation. 

Table 2.1 serves as a design-guide table for the improve- 
ment in performance {in terms of mean access time) as the 
design-criterion. For the fractional loss of information. 
Table 2.3 is to be referred to. '?’ablcs 2.4 aai 2.5 
compare the analytical results with the simulation results 
for performance-improvement (t]) and fractional loss of 

information {R_ ) respectively . 

Jj 

For the anpj.,ticfa values agree .ell with th. 

simulation values. Figtire 2.5 shows an exponential decay 
in Rj^ with an increase in buffer-length. In order to 

-3 

achieve <. 10 one may select L = 6, 8, 12, 20 and 

32 for p = ^f, ^f, f and 2f respectively. (N,IIRT) 

pairs of (512, 50 T ); (1024, 100 Tji (2048, 200 T ) and 
(4096, 400 Tg) give rise to p s= NTg/HBffi » 10.24 « f. 
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For T|, the analytical values arc close to 
the sim\alation vrluea. For L much greater than 
r.nalytical value of n is slightly Eorc than the corres- 
ponding siiEiilation value. For L wuch smaller than 4-* 
analytical value of ti is less than the corresponding 
simulation value. Figure 2.6 shows rapid increase in n 
with small increase in buffer-length for 1 less thanp • 

To illiistrate with an example, we consider N = 512 and 
MIW>=100T„ to provide P= 5.12. An addition of a buffer 

o 

of length 2 exhibits about 70 percent improvanent in 

performance. In order to achieve r) 2:. 95 percent* one 

111 

may select I. = 2 * 4, 6 , 8 and 16 for P = 0 ^» 4 ^* 2 ^» 
f aM 2 f» respectively. 

Table 2.6 records for heterogeneous input 
and homogeneous output processes. Fracticnpl loss of 
insertion requests with altcmsting MET can be approxi- 
mated by the average of for respective MM. Retrieval 
loss is slightly mere than the expected value from the 
Table 2.3* This results because the assumption of 
infinite source of population f ' r G/M/1/1 model does 
not hold good when 31 becomes empty. 

Finite queue Skini.er*s models for M/G/l/h and 
G/M/l/L are simulated. The analytical results agree 
well with the simulation results. 
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2.11 appmcations 

The prcpcpcd buffered serial ncncry is useful 
with heterogeneous input and output processes [G.PIiilckyprou» 
1974]. Typicr.l application areas include data compressicaa, 
data acquisition systems, pcripheral-CPU interfacing, time- 
shrred systems, and cemmuniert ion networks with store and 
forward facility [b.C. Chen, et.al., 1976]. 

2.12 G0NGLU3ICM 

A b\iffcred serial memory with constant clock BCKs 
is proposed. The entire system can be studied by analysing 
two practically independent subsystems: (l) input process, 
input buffer and periodic aaptying of buffer, and {2} output 
prccess, output bxiffer and periodic filling of buffer. The 
first subsystem is fcmulated as an M/G/l/L queueing system 
with server-relaxat i<.n time, that is, a finite queue 
Skinner's model; and the second as a G/H/l/L queueing systcaa 
with source-relaxation tire. Both of these models are 
duals. A comput«>t ionally convenient analytic method is 
developed to compute btxffer-uccupancy probabilities and mean 
buffer-length. The improvement in porfcrmance (q) ever an 
unbuffered serial renory, end the fractional loss of 
information (Hj^) are considered the design criteria to 
obtain an optimal length of buffer. Analytical values of 
q and agree well with the simulation resul+s. 



CHAPTER 3 


BUFFERED ^TACK MMORY VARIABLE GEOGK ECM 

Staoks are widely used in various fields, namely, 
multiprocessing, dynamic programming, evaluation of 
rec\irsive fvaactiona etc- Major shortcoming of a stack- 
based machine is a degradation in array processing. A 
buffered stack overcomes this shortcoming to a large extent. 
A scheme for a buffered stack organization io proposed. This 
consists of a J.arge size unidirectional (static) shift- 
register stack (BS) and a small size bidirectional fast 

.(f. 

(buffer) stack (FS). Perforaance-improvament of a buffered 
stack over an unbuffered stack ’consisting of entirely uni- 
directional SRs is considered the design criteri cm to 
determine the size of buffer (PS). 

3-1 IHTEQDUCTICW 

A stack is a storage mit with Last- In-Fir st-Out 
(LIFO) access discipline, stacks can be used in variovis 
ways, nsmely, providing a ’one for one’ correspondence 

■ 

between source language operators and the machine 
instructions [R.J.Bvey, 1963]» preserving code and data 
conditions to facilitate rapid environment change and 
hence saving an overhead for tinlimited nesting of proce- 
dures; providing re-entrant capability of code to 
facilitate recursion; allowing fast interrupt handling; 
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increaping the speed of computation by virtue of code 
comprespion which is inherited frCM its implicit addressing 
[E. I.Organick, 1975; Hauck and Dent, 1968; G.B. Garlon,19653» 
Application of stacks can be noticed in mxiltiprocessing 
subroutine operations, language translation, dynamic 
programming problems, efficient processing of object 
program written in a structured Ismguage like AlGOL, 
and facilitating 60TO-less programming by maintaining a 
control environment [Bvey, 1965; Bums and Savilt, 1973s 
Bigelow, 1975] • 

Stack-tesed machines, like B-6700 suffer in the 
area of array processing where a great deal of indexing 
is involved. Barton [1970] suggests an adeqtiately large 
'fast stack top*, say 52 fast registers on a slow stack, 
in order to solve the problem of great deal of indexing 
encountered in array processing. Salarged fast stack top 
further offers a convenient opportimity to introduce 
hardware supported vector operations. 

A «tack can be simulated by software. However, a 
hardware stack offers a substantial increase in the* speed 
of computation. 

A stack Can be formed by using a set of unidirect- 
ional shift-registers with some logic circuitry associated 
with it. Such a stack forms an asymmetric memory 
[j.G. Williams, 1975] as the ra^io of insertion and 
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retrievel times is sufficiently large, i.e.^eousl to the 
size of stack minus one. A tyj ical stack of size {N+1) 
would require !« circular shifts for an insertion in order 
to make the recently inserted word available cm top of 
the rtack, whereas only one shift is reqiiired for a 
consecutive retrieval* Kean access +ime will be (K+l)/2- 

In this thesis a fart stack which consists of high 
speed registers or bidirectional shift-registers is 
suggested to be added to the unidirectional shift -register 
stack. This will improve the access time by virtue of 
enabling large ntaaber of requerts to be satisfied from 
the fast stack. 

An asymmetric stack memory consisting of an WR of 
size IK operating at 5 MHz exhibits the mean access time 
of 0.1 msec., that is 512 shift-periods. An addition of 
a small size fast stack will reduce it to a few shift- 
periods. 

Processor accesses the stack by generating two 
kinds of requests (see Pigiire 3*1 )i (1) request for 
inserticjn, and (2) request for retrieval. These requests 
occur randomly and may be assumed to follow a particiilar 
probabilistic distribution. 

Our design objective is to develop a schoae so 
as to make the large size asymmetric stack look like an 
entire fart stack [Figure 3.1(a)] in performance. 
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In the following Rectionfl a scheme for a bixffereti 
stack memory is proposed and its mathematical model is 
developed tc determine the optimal size of the fast stack. 

3.2 DEFCRIP7ICN OF THE SCHEI-IE 

Processor issues requests for insertion into and 
retrieval out of the stack memory (see Figure 3»1). 
propose a buffered stack memory organization (Figure 3*2) 
consisting of large size ECMs with variable clock-rate (sta- 
• tic 3Bs)» a small size fast-stack (PS) and the associated 
circuitry, m H-bit shift-registers in parallel can store 
N-words, m-bit/word. These together with S-register form 
an asymmetric slow stack (?'?). m L-bit bidirectional high 
speM shift-registers form a fast stack. Typically, N is 
in the range of 512 to 2048 and L is 8 to 32- Transfer-- 
rate, 1/T , of a shift-register may be 1-5 Mb/S. Processor 
accesses a data word via a buffer-regitier. Bata word is 
transferred from the buffer register into the fast stack 
for insertion. Insertion request is assumed to be rejected 
at the instc-nt the fast stack is found full and the 
processor has to wait until a room is created era the top of 
fast stack. Retrieval request enables the transfer of the 
top of fast stack into the buffer register. If P3 is 
found empty the Processor has to wait until the head of 
the informat ion-ch\jnk (ic) reaches the output port. 

PS cotanter (f.s.c.) indicates the current content of PS. 

PM@ is a flip-flop which is set when a data word is 
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ptored in both FP and FLAG ia reset when initially 

p*ored duplicate data word is shifted out due to a request 
for insertion. and are the wired OR junctions. Ve 
have discussed this scheme in an earlier paper [ ? . Rs j araman 
and Om Vikas, 1975]- 

3.3 OPERATIOH 

Whenever there is any request for insertion or 
retrieval, F3 is inspected and depending on its occupancy 
• it is either satisfied immediately or required to wait 
until S3 is ready for the transfer of a data word- As 
a reque'et for insertion arrives, PS may be found in caie 
of three possible states: empty, fiall and partially filled - 
If FS is found empty, the desired data word is wri‘’'ten 
into PS and the S-register. Subsequently FLAG is set. FS 
is made available immediately for a next request, whereas 
the content of SS will be circularly shifted H times to 
make the recent insertion available on the top of the 
stock. 33 is assumed of size (H+1). There will be 
overlapping of the operations on F3 with a write operation 
into 33. If FS is found full, the flag is examined. If 
flag is foxmd set which implies that the bottem of FS is 
already available as the top of SS, then FS is shifted 
down and no insertion into SS is initiated. At the same 
time the flag is reset. If the flag is foiaad reset and 
PS is full, FS is shifted down into SS provided SS is 
available otherwise the system waits. Subsequently a 



68 


room on the top of F*? in created for a new insertion. If 
FS is partially full, desired data word is inser+ed, 
imoediately, into F®!. 

When a request for retrieval arrives, the top of Ff? 
is transferred to the buffer register provided the PS is 
non-empty. If FS is found anpty, the flag is examina3. If 
the flag ia set which implies a duplicate data word in SS, 
the top of SS is shifted out and the flag is reset. 
Subsequently, the top of SS is transferred into the buffer- 
register. 

3.4 CEIfERlA OF GOODNESS 

The main objective to develop the buffered SE stack 
organization is to achieve the performance which approaches 
that of the buffer and the price/hit which approaches that 
of SB. 

Mean access time can be consid red a measure of 
performance. The size of fe'^t stack should be as small as 
possible in order to reduce the price/bit. toother consi- 
deration may be that the processor waiting time should not 
exceed its allowable waiting time, 

3.4.1 PlRFOBMtoCl-IMPlOVaiEHT 

to ideal stack (Figure 3.1(a)) and an asymmetric 
unbuffered stack (Figure 3.1(c)) lie on the extreme ends 
of the spectrum of the stack memory organization. The 
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performance of a stack memory organization can be 
expressed with respect to that of the stack cemory 
con-isting entirely of the unidirectional shift-registers. 
We define the performance- improvement {r, ) of a buffered 
stack (Figure 3-l(b)) over an unbuffered stack (Figure 3*1 
(c)) as follows. 

^b 

a* (1 - ■=— ) X ICK) percent 
•^u 

where T, » mean access time of the buffered stack, and 

D 

= mean access time of the unbuffered stack. 

In the case of ideal stack, = 0 and haice 
Tj « 100 percent . For an unbuffered stack, and 

hence ti « 0 percent, that is no improvement. 

In the following section, a mathematical model 
is developed to eexapute the mean access time, T^. 


3.5 MATHBMATICAI FODEL 
3.5.1 Di^lCRIFPIO; 


(H+1) 

1/T^ 

L 

lAf 


We use the following notations 
: rize of '?S, 

s maximum transfer rate of IS, 

i size of FS, 

i maximum transfer rate of FS, 
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: time for an Insertion into SS, 

Tj. : time for a retrieval out of 3S, 

T : minimum time between consecutive requests. 

It is to be noted that = W T_ and = T_- 

1 s Jr ' B 

^i "^r* 

A stack is a memory with LIFO access discipline. 

The buffered stack m«iiory orgpnization can be formulated 
as a queueing model with IiIFO sei^rice discipl^ne- 
D.M.G. Wishart [1960] sttdied M/G/1 queueing system with 
LIFO service discipline and compared the results with 
those in the case of FIFO service discipline. He concludes 
that the busy period distribution, the mean queue length 
and the mean waiting time remain the same for both FIFO 
and LIFO service disciplines. However, vsrianca of 
waiting time is less in FIFO thah in LIFO, and the 
probability cf congestion is more in FIFO than in LIFO. 

A customer is congested if he arrives to find the service 
point occupied and has to wait more than the unexpeaded 
service time of the custcxner receiving service. 

We are ccmcemed with the mean waiting time, hence 
the above model is equivalent to one with FIFO access 
discipline. 

The simplest request pattern to handle mathema- 
tically is the Poissem process. This arrival process 
has been widely studied [W. Feller, 1968]. Poisson process 




71 


iapliee that the probability of k arrivals with an arrival- 
rate of Y in an arbitrary interval of time, t, is 

Prob[k arrivals in interval t] = ^ 

and the inter-arrival intervals have the exponential 
density function 

Prob[ int er-arr if al time » t] « y e ^ 


let X and p be the mean arrival rates of the 
insert ion-and ret rival -requests separately. The inter- 
arrival intervals for requests of both kinds follow 
exponential density function, that is (x+l^) 
and the probability of k arrivals of requests of both 
kinds in an arbitrary interval of time, t, is 


5 . 5.2 AKA1Y3IS 

The proposed stack memory organization (see 
Figure 3.2) can be viewed as M/H/1 queueing system, i.e., 
Poisson arrival of insertion requests (M), Exponential 
inter-retrieval time (M), and single server (1). State- 
transition-rate diagram is shown in Figure 3*3. The 
departure from the classic M/M/l model is that this model 
allows non-nearest-neighbour tiunsitions as well. Hot ion 
c 
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I»et denote the state with i items in the syataa. 
Notice that ^ transient state which resiJlts when 

FS is foxind full the first time after being aapty. This 
state corresponds to shifting out the duplicate data word 
stored in FS. virtually, Sj^ hence it is not shown 

in Figure 3*3* arrival of insertion 

request after shifting out the d\; 5 )licate data word. 

Tf < l.a<is to an aaennption of in.tant 

response from FS for both insertions and retrievals. 

T^ < 1/p also validates the assumption of instant response 
from SS for retrievals only. may be referred to as 

V 

the reorientation period. T^ denotes the reorientaticm 
period initiatai in the state Sj^.* 


Imminent insertion requests for the transiticai 
®o* during and T^ are blocked. This is accounted 
for by introducing a factor ejL 1. By the transition, 

% mean shifting out the duplicate data word 

stored in FS. a accoxmts for the blocking of the 
transiticai ^1+2 

let Pj^(t) dmaote the probability that the system 
is in the state at time t. The differential equaticais 
describing the proposed model are 

Po(t+6t) « p 6t Pj^Ct) + (l-d 6t) P^Ct) 

Pj^(t+6t) B p 6t p2(t) + el 6t P^Ct ) + (1— 16f)(l-p 6t)P2(t) 
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dt +4 6t 

+ (1- k 6t)(l-n 6t) P„(t) 

for 2 ^ n i 1-2 

Pj^_3^(t+6t) * |i 6t Pj^(t) + p 6t ^L-2^^^ 

+ (l-X6t){l-|i 6t) Pj^__^(t) 

p_(t+6t) » X6t P, T(t) + (1-aX 6t){l-M 6-t ) P^ (t ) 

M Jj— -i Jy 

Pj^^2(t46t) » aX6t Pj^{t) + (l-^i 6t) Pj ^^2 

¥e are priinarily IntereBted tn the steady state 
solatlcm. Conseouently , the set of differfflatial equations 
reduces to the following set of equilibriua (balance) 
equaticms. 

e>Po - H P;j^ 

(X +jA) at Ji Pg + e X 

(X+n) p^ » p ^n+1 ^^n-1 ' 2^ n£ Ii-2 

(i+l*) - H Pj, + tl Pj_^2 +i Pj,_2 

(« x^p) Pj^ « X Pj^^ 

* 1 + 2 " “ * H 

for p « X /|i, 

Pj_ «* e p p^ , 

^ ^n-1 ' 2iLa^J»-l 

' l44Ei 

Jlr**'* 

*l+2" “ P 




H, .r# i.4 ilAylar 
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Probability conservation relation states that 

i*0 ^ 

and hence 

L „ 

P^(l + e t r) - 1 
fffel 

For p « 1# 

^o “ lin ' 

^ l4n6L-l 

JO. c# 

“ 1+a *’l-1 ’ 

and a 

In order to determine the gating factors c and a# 
a Markov process cmtaining two absorbing states is 
developed [s. Beo, 1974]. This can be illnstrated by 
taking a specific example (see Figure 3 . 4 ) with L » 4. 

*5 and are the abeorblng etatee. Both and 

correspond to the state when PS is emp+y. The latter is 
reached on return, "^he resulting transition matrix, M, is 
given below, 

»o % % ®4 % h 

1/2 1/2 0 0 0 0 0“1 

0 0 1/2 0 0 0 1/2 I3, 

0 1/2 0 1/2 0 0 0 1 

1 - 0 0 1/2 0 1/2 0 0 b ! 

© 0 0 1/2 0 1/2 0 

0 0 0 0 0 1 0 K 

JL^ ^ ^ 0 0 0 1 ^ 
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The following recurrence relation can be used to compute 
the probability vectors. 
it(k+l) = 

where it(k) is the probability vector at the k-th instant. 
Initial probability vector can be chosen as 

ltd) « (1 0 0 0 0 0 0) . 

Subsequently! 


ii(2) « (1/2 i/2 

0 0 

0 

0 

0) 

w(3) = (1/4 1/4 

1/4 0 

0 

0 

1/4) 

ii(4) = (1/8 1/4 

1/8 1/8 

0 

0 

3/8) 

^(5) = (1/16 1/8 

3/lC 1/16 1/16 0 

1/2) 

«(6) * (1/32 1/8 

5/32 1/8 

1/32 1/32 9/16) 


and so on. 

«^{k) as Prob[PS is full and SS is requested] 

itg(k) = Prob[PS becomes empty] 

%^{k) and are the cumulative probabilities upto 

k steps. In the above example w^(6) « l/32» =s 9/16. 

Let j ) = Prob[ first passage through at 4“th instant] 

) * Prob[ first return to origin at j-th instant] 

It is to note that f^(2m) = 0 and f2(2a-l) = 0. L is 
ccsiflidered to be a multiple of two. Further* 
f3^{2»+l) » ii^(2m+l) - w^Caa-l) * 
fgC^a) » Wg(2») - iEg(2»**2)* for ®*1,2,... 



w^(k) gives the conditional probability, fhe foUoidlng 
transition probabilities are of interest in calciilating 
limiting state probability let denote the first 

passage probability corresponding to the i~th path in 
Figure 3.5« 


» Prob[ First request for transition -*• without 

returning to 3^(orlgln) during j 


h^ * Prob[ First request for insertion after the first 
return to without reaching during 

hj » Prob[ First request for insertion without returning 
to 

h^ » Prob[Firat request for transition Sj^ ^^2 without 
reaching during 


h^ a* Prob[Fij^t request for retrieval without returning 
to d^^ring 

hg = Prob[ First request for retrieval after the first 

Oi 

return to Sq without reaching during T^J 
. PrebtPirat requent for insertion during . 

It can be noticed that 


I 9 -{ 2 k-fl). Prob[(2k-fl) or more requests occui 
k*=o 

during 

I Cl- I ] 

|r«5 0 * i»0 J • , ' ; . 




stages :,- 
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* 4 y ®„{2k). Prob[(2k+l) or nsore requcFts occxir 
2 2 2 

during T^] 


2 k*o 

hty = 1/2 


I g>,{2k).[l 


2k [{X+|i)T.j^ ~(X+^)T. 


J 


Fur+hert 


ss hj = ' 

hg « h4 = ^2 • 


5.5.3 BSTIIiATIOIf OP THE GATING FACTORS 

e is considered for the transitioi - % which 
depends cm the probability of reorientation period being 
over. Reorientation period could be initiated in 3 ^ or 
^Ii+2* eatiaate £ as fcsllows 


(1 - h. 


^3 ^1+2^ 


a is involved in the transition Sj^^2 w^ich depaaids 

on the probability of reorientation-period initiated in 
Sq or ^^2 over. ¥e can write 


C|. m 

where 
«1 * 


^ “ ^4 ^lt2 ^ ®1 ^o 


ProbCtranaiticm 

*5 ( ) 


IrJ-1 


occurs] 


for fi m 1, 

“ - ^ - •’4 ' *0 ■ ^ ®o 

o*+ ^ ** 
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Solution Procedure ; 


1. Initial approximation 

e = 1 - . 

= 1/(1 + c 1 ) . 

2. X = c 


3- Compute a from 

a* + ~ ^ * 


4. 


L+2 


1+0 ^ ^o 


f 


c « 1 - 1 x 2 ~ ^3 * 

Pq » 1/(1 + e i) , 

If U-Xl 210"^ then goto 2 

f or 1 s n < I*~l 

^ 1 + 2 “ “ * 


p * e P , 
n o * 


Stop. 


3 . 5.4 MEAN ACCESS TIME 


Let T,, be the mean access time of an uabuffer^J 
stack organization consisting of entirely unidirectional 
shift-registers. 

Tu . i (S.DT^ 

PDf of the tloe for n-th request is computed from the 
n-th convolution probability density function of inter- 
request intervals. 
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(n-l)l ® * for X 


Mean Waiting Time for n-th requwt 


■0 ' 


n-1 ( 2 AT )-^ - 2 XT. 

V ^1^^“ A ji — ® ^ ^ 

•• * J*i# ^ 


S-Ft f (2AT^) -21T^ ^ 

2^1 1~ 2 J » ® ] 

jr=o 


Waiting tiae oay arise due to either (a) insertion 
requests (paths 1, 2, 3 » 4, 7) or (b) retrieval requests 
(paths 5,6) (flee Figure 3*5). 

Let Xj, ^6 Bean access tljue in the i-th path 

== 1 " I, 


1 

m t ' W - 

2 2(2n+l) 2»>-H 


^7 * “l 


As requests fcr insertion and retrieval are equi- 

probable, we can write Xg = x^* x^, Xg and Xg are the 

Bean access tiBes as a resialt of a request waiting during 
■ 0 

« Slffliiar analysis will yield th# Been access tijiee, 
naaely x^, x^, nnd Xj as a result of a request waiting 
during It is to be noted that x^ - x^, x^ ■ x^ and 

*5 • Xy H«nce» : 

f . 



83 

fABLB 3»1 Performance- jjaprovemait (ii) of buffered 
stack memory irith variable clock ECM. 


MRT (x T„) 




L 

25 

50 

75 

msm 




300 

350 

400 

2 "" 

69-1 

73.9 

78.1 

81.2 

85.4 

88,0 

89.7 

91.0 

m.9 

92.7 

4 

80.3 

83.4 

86.0 

88.0 

90.7 

92,4 

93.5 

94.3 

94.9 

95.4 

6 

85.5 

87.8 

89.7 

91.2 

93.2 

94.5 

95.3 

95.9 

96*3 

96.7 

8 

88.5 

90.3 

91.8 

93.1 

94.7 

95*7 

96.3 

96.8 

97.1 

97.4 

12 

91-9 

93.1 

94.2 

95.1 

96.3 

97.0 

97.4 

97.8 

98.0 

98.2 

16 

93.7 

94.7 

95.6 

96.2 

97.1 

97.7 

98.0 

98,3 

98.5 

98.6 

24 

95.7 

96.4 

96.9 

97.4 

98.0 

98.4 

98.7 

98.8 

99.0 

99.1 

H « 1024 

2 

66.6 

69.0 

71.5 

73.9 

78.0 

81.2 

83.6 

85.4 

86. 8 

88.0 

4 

78,7 

^.3 

81.9 

83.4 

86.0 

88,0 

89.5 

90.7 

91.6 

92.4 

6 

84.3 

85.5 

86.7 

87.8 

89.7 

91.2 

92.3 

93.2 

93.9 

94.5 

8 

87.6 

88.5 

89.4 

90.3 

91.8 

93.1 

94.0 

94.7 

95.2 

95.6 

12 

91.2 

91.9 

92.5 

93.1 

94.2 

95.1 

95.8 

96.3 

96,7 

97.0 

16 

93.2 

93.7 

94 . 2 

94.7 

95.5 

96.2 

96.7 

97.1 

97.4 

97.7 

24 

95.3 

95.7 

96.0 

96.3 

96.9 

97.4 

97.8 

98.0 

98.2 

98.4 

JS « 2048 

2 

65.3 

66.5 

67.8 

69.0 

71.5 

73.9 

76,1 

78.0 

79.7 

81.2 

4 

77.9 

78.7 

79.5 

80,3 

81.9 

83.4 

84.8 

86.0 

87.1 

88,0 

6 

83.7 

84.3 

84.9 

85.5 

86.7 

87.8 

88.8 

89.7 

90.5 

91-2 

6 

87.1 

87.6 

88.1 

88.5 

8?. 4 

90.3 

91.1 

91.8 

92.2 

95.1 

12 

90.8 

91.2 

91.5 

91.9 

92.5 

93.1 

93.7 

94,2 

94.7 

95.1 

16 

92.9 

93.2 

93.5 

93.7 

94.2 

94.7 

95.1 

95.5 

95.9 

96.2 

24 

^.1 

95.3 

95.5 

95.7 

96.0 

96.3 

96.7 

96.9 

97.2 

97.4 
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«= t 


*6 = ^4 = *2 * 


We compute the mean access time and the 
p erf onaance- improvement r, of the buffered stack as follows: 

■^b * >^1+2 + K ^1+2 


fj *s (1 - X 100 percent 


The analytical values for the perfoimance- improve- 
ment are recorded in Table 3*1 for N = 5l2f 1024» and 
2048 . 


3.6 VlfilFI CATION OF THl ABOVE MQDSB 

A simulaticm stidy was carried out in order to 
verify the analytical results. The program wan written 
in GPSS III and processed on IBM 7044. Table 3.2(a) 
and (b) compare the analytical and the simxalatian valuee 
for the mean access time, T|j,and the performance- 
improvement, ii»of the buffered stack memorlea with® « 512 
and 1024 respectively. The coofidencc-interval for the 
simulation results is computed as follows. For a fairly 
large sample of size S, the variance of the sample mmn 
ia estimated as o*/S and the distrihutim of the sample 
mean can be regatrded as normal. Given the saa^ile mean,m., 
and the sample? variance, o®, the 95 percent confidence- 
interval, c, for the population mean lies b^,twecn 
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n -1»96 o'/fS and e + 1.96 o//S« We obtain the mean access 

tiEc, T^t and the standard deviation, o, and compute ti 

and its conf idence-inte*rval usin^ the above values. To 

illustrate with an example, let T^ = 41.1, a = 118.0, 

H = 512 and S = 50C0. We can compute pcrformance-iffiprov©- 

fficnt, r\ = {l~Tv/T„) x 100 percent, that is r,= (l- )xl00 

^ (512+1) 

= 84.0 percent. The 95 percent eonfidcnce-int erval for 

T. is {T|^*-y) and (T.+y), where y = 1. 96x118. 0//5000* Hence 
e D j - T. T **Ty^ 

T) lies between ^ - l-)x 100 and ( \ ^ ) i 100. 

We can write c = y/T^ which defines the 95 percent 
confidence-interval of t). Tables 3.2 and 3*3 record the 
Values for and (h + c). In the analysis, we assume 
expcmentlal distribution of mean inter-request times? The 
analytical rcsul+s are compared with simulation results In 
Tables 3.2(a) and (b). 

For Brlangian distribution of inter-request times, 
the simulation study was carried out. Table 3*3 records 
values for T^ and q for Irlang constant E * 1,2 and 8, and 
II *» 512. It is to be noted that B = 1 means exponential 
distribution and B » oo a con'^tant distribution. 

Figure 3.6 is a plot of performance- improvement 
versus buffer-length for H » 512 and MET - lCX)Tg, 200Tg and 
400T^. Solid lines correspond to the simulation results, 
whereas dashed lines to the analytical results. Figure 3»7 
is a plot of q versus MET for I, * 4, 8 and 12. 



TABLE 

3.2(a) Buffered stack with variable clock fiCM- 
Ccwsparison of analytical and simulation 
results. H tt'512, B = 1. 
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SIMULATION M/JLY^ICAL 

SIMULATION analytical 

Ql 

, L 
) 



T) 

MfiT T 

(X T,) ^ 

T^ n±c 



25 

2 

122.5 52.4i:2.2 

T 9. 2 

69.1 

50 2 

100.5 61.0±1.9 

66.8 73.9 


4 

68.5 73.5il.7 

50,3 

80,3 

4 

56.6 7e.2±1.5 

42.4 

83.4 


8 

35.5 86.4+1.3 

29.3 

88.5 

8 

28.4 89.1+1,1 

24.8 90.3 


12 

27.5 89-5+1.2 

20.8 

91.9 

12 

28,5 91.1±0.9 

17.5 

93.1 


16 

19.8 92.6±1.0 

16.1 

93-7 

16 

15.0 94 . 1 + 0.9 13.6 

94.7 

75 

2 

84.9 67.7+1.7 

56.2 

78,1 

100 2 

73.8 71.5±1.6 

48.0 

81.2 


4 

48.4 81.3+1.4 

35.8 

86.0 

4 

41.1 84.0^1.3 

30.6 

08.0 


8 

22.3 91.5±1.0 

20.9 

91.8 

8 

20.3 92.2+1.0 17.8 93*1 


12 

17.0 93.M.9 

14.7 

94.2 

12 

14.1 94.6i0.8 12,5 

95.1 


16 

13.3 95.C^0.8 11.4 

^.5 

16 

11.7 95.8±®«® 

9.7 

96.2 

150 

2 

50.7 80.2+1.4 

37.3 

85,4 

200 2 

38.2 ^.2il.l 

30.8 

88.0 


4 

31.0 87.9+1.1 

23.7 

90,7 

4 

22.1 91.5±0.9 

19.5 

92.4 


8 

17.0 93-%0.8 13-6 

94.7 

8 

13.8 94 . 610.7 11.1 

95.7 


12 

11.7 95.8±0.7 

9.5 

96.3 

12 

9.1 96.5+0.6 

7.8 

97.0 


16 

10.0 96.1+0.6 

7,3 

97,1 

16 

5,9 97.&fc0.5 

6.0 

97.7 

300 

2 

29.4 88.7+1.0 

23.1 

91.0 

400 2 

20, 2 92.1i;0.9 

18.7 

92.7 


4 

17.5 93 . 2 + 0.8 

14.5 

94.3 

4 

13.2 95,0i0.7 

11.7 

^.4 


8 

9.5 96.5±0,6 

8.2 

96.8 

8 

6.9 97.4±0.6 

6. 6 

97.4 


12 

6.0 97.7i;0.5 

5.7 

97.8 

12 

3.0 98.9+0.3 

4.6 

98.2 


16 

5.0 98.0±0.4 

4 * # ^ 

98,3 

16 

2.9 98,9+0.3 

3.5 

98,6 

Mlf > 

Mean inter-request time, 

, % * 

Length of buffer (fast stack), 



* Meim access time, t) « Performance-improvement, 
e » Confid«i 0 e-interval. 



87 


fABLB 3- 2(b) Buffered stack iritjbi variable clock E(3M. 

C<Mipari 8 ca 3 i of analytical and sioiilation 
results. N 3 = 1024 » 1 = 1 . 



SIiroiATIOM 

AHAlYTICia 



SIMULATIOH 

ahulyticaI 

MfiT r 

(x Tg) ^ 

% n±c 


T1 

MET 

(x Tg 

) ^ 

’'b 



n 

25 2 

270.0 47.3±2.2 

171.2 

66.6 

50 

2 

245.0 

52.4±2.2 158.5 

69.0 

4 

163.7 68.2+1.9 

109.0 

78.7 


4 

138.3 

73.1±1.7 

100.7 

80. 5 

8 

79.0 86.4±1.3 

63.7 

87.6 


8 

70.9 

86 » 4 jhl • 4 

58.7 

88.$ 

12 

60-8 e8.3il.3 

45.0 

91.2 


12 

51.4 

90.1±1.1 

41.6 

91.9 

16 

48.8 90,3+1.1 

34.9 

93.2 


16 

40.4 

92.2±1.0 

32.2 

93.7 

75 2 

212.8 58.6i;2.0 145.9 

71.5 

100 

2 195.6 

62.0+1.9 133.7 

73.9 

4 

128.2 75.Q+1.6 

92.6 

81.9 


4 

108.7 

79.q±1.5 

05.9 

83.4 

8 

72, 2 85.%1.3 

54.0 

89.4 


8 

58*4 

'88# 7^1 • 2 

49.6 

90,3 

12 

60.9 88.3±1.2 

38.3 

92.5 


12 

47.0 91.1±1.0 

35.1 

93.1 

16 

32.5 93.%0.9 

29.6 

94.2 


16 

25.3 

95.210.8 

27.1 

94.7 

150 2 

165.2 67.^1. 7 

112.5 

78.0 

200 

2 149.7 

70.711.5 

96.2 

81. 2 

4 

98.2 80.9±1.5 

71.7 

86.0 


4 

78.0 

85. Oil. 3 

61.4 

88.0 

8 

45.0 91.7i:1.0 

41.8 

91.8 


8 

40.7 

92.2+0.9 

35.6 93.0 

12 

34,0 93.at0.9 

29.5 

94.2 


12 

29.8 

94.410.8 

25-1 

95.1 

16 

26.4 95.C^.8 

22.8 

95.5 


16 

23.4 

95.8±0.8 

19.3 

96.2 

300 2 

97.6 80.8^1.3 

74.8 

85.4 

400 

2 

76.4 

85.2±1*2 

61.7 

88.0 

4 

57.9 88.9±1.1 

47.5 

90.7 


4 

92.6 

89.911.0 

39.0 92.4 

8 

34.0 93.840.9 

27.3 

94.7 


8 

28.2 

94.61O.8 

22.3 

95.7 

12 

27.0 94.%0.7 

19.1 

96.3 


12 

18.2 

96. 5lO. 7 

15.6 

97.0 

16 

20.2 96.14:0.6 

14.7 

97.1 


16 

15.6 

97.1i0.6 

11.9 

97.7 

MET = Mean inter-request 

time. 

J[| * . 

Imgth 

of 

buffer (fast stack). 


= Mean 

access timet 


fj m 

Perfomance improvwent. 




c »» Confidence interval. 
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TlBLB 3-3 Effect of variation in Brlang constant (S) 
an the performance of a buffered stack with 
variable clock B®. M = 512- . 




B 

= 1 

B = 

2 


E = 0 

mi 

B 




ti+c 


TJ + C 

100 

2 

73-8 

71.5±1.6 

72.2 

71.9il.5 

70.0 

T2* 2 


4 

41.1 

04. Oil* 3 

37.6 

85.6il.2 

36.0 

86.0+1.1 


8 

20.0 

92.2il.O 

19.6 

92.3+1.0 

19.0 

92.6iO.0 


12 

14 •! 

94.6iO,8 

13.2 

95.Q±0.7 

12.6 

95.1iP.5 

200 

2 

38.2 

85.2±1.1 

36.8 

85.6il.l 

34.8 

86, ^.9 


4 

22.1 

91.5i0.9 

18.5 

92.8i0.8 

18.1 

93.0iO,6 


8 

13-8 

94 . 6i0, 7 

13.5 

94.7±0.7 

13.0 

94.9+0.5 


12 

9*1 

96.5+0.6 

8.7 

96.9±0.6 

8.0 

96.9ip.4 

500 

2 

29.4 

88.7il.O 

21-6 

91.8iO,8 

16.4 

93.%0.6 


4 

17.5 

93.2±0.8 

12. 6 

95.4i0.7 

10.6 

95.9i0.5 


8 

9.5 

96.5iO. 6 

8.4 

96.9i0.5 

6. 6 

97-7i0.3 


12 

6*0 

97.7±0.5 

6.0 

97.7i0.4 

5.7 

98.9±0.3 

400 

2 

20,2 

92.1+0.9 

14.3 

94.4i0.7 

12.4 

95.4±0.6 


4 

13 . 2 

95-0+0.7 

11.1 

95.7±p,6 

0.5 

96.9±p.5 


8 

6.9 

97.4+0.6 

4.6 

98,2i0.4 

3.1 

98.9+0.3 


12 

3.0 

98.9i0.3 

2.8 

99.0±0.3 

2.4 

99.0i0.2 


URT w Mean inter-requeet tiBtte» L » Iiength of buffer 

(fast stack) 

- Kean acceee time, r, - Performance improvement 

c *x Confidence interval* 






l<yfncinc« 




Per+wmance- Impfovemen! 



MRT,{ * Ts ) ' 

Fifttre 3.7,^ itutriP'fc& stAtK VAyaABO; 
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3,7 DISCUSSION OF RESULTS 

Table 3»1 serves as a design guide table. To 
illustrate vith an exanple* a buffer of length 8 with a 
shift -register stack of size 1024 exhibits 90.3 percent 
iffiprovement in performance over an unbuffered SR-stack 
of the same size for the mean inter-request time of lOOTg. 

We observe in Table 3*1 that n depends on the 
product (p) of mean arrivel rate, y» reorientation 

time, NT„, that is p = y NT„. We define a constant f equal 
to 10.24. P can be expressed in terms of f. p in teaoes of 
f is recorded in Table 3*4 for a few pairs of H and MRT. 

For a given L, n is identical for (N = 512 and MIT » lOOTg ) $ 
and (M * 1024 and MRT *= 2CK)T|^)a8 P=* ^f for both of these 
pairs. 


table 3.4 

p in terms of f 
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Analytical results agree well with those obtained 
from the simulation. The access time decreases exponentially 
with an increase in the size of buffer. The lower the value 
of p the higher is the value of p (refer to Figure 3*1?) • 

Consideration of Erlangian distribution of inter- 
request times shows that an increase in the value of 

fc. 

Erlang constant, E, results in better perform.ance. However, 
the increase in p is not appreciable. The confidence- 
interval shrinks with an increase in Erlang constant. 

3.8 applications 

Typical application areas of buffered stack include 
language translation;' subroutine operations; recursive 
operations; dynamic programming; and processing of object 
programs writ+en in a structured language, like ALGOL. 

3.9 CONCLUSION 

A scheme for a buffered stack memory with variable 

clock ECM (e.g., static shift-registers) is proposed. An 

addition of a small size buffer (fast stack) to a large 

size ECM improves its performance drastically. In order 

to quantify the improvement, we define the p erf o mane e- 

improvement (p) of a buffered stack over an unbuffered stack. 

A fast stack of size 12 yields the performance- improvement 

of more than 95 percent with an BOM of size 512 for 

MET >. lOOT . 

s 



CHAPTER 4 

A HOVEL TSCmilQHB POR TOTIPLE IR3ERTIOHS 
AND DELETIONS 

1 -I ' ■ 

A scheme is propossed Ixi this chapter to facilitate 
multiple insertions and deletions in an EGM. The property 
of information-mobility is exploited to perform these 
operations. j 

4.1 INTRODUCTION ‘ 

Information moves in an Electronic Cyclic Memory 
(ECM). An information-channel within the device remains ; 
fixed in CCDs and MOS shift-registers, whereas it can he 
varied in magnetic huhhles. This additional flexibility j 

facilitates operations, like global shift, detached | 

shift, exchange and delta-exchange in a Magnetic Bubble i 

Memory. These are the basic operations required to perform 
dynamic data rearrangement [C. Tung et . al. , 1975 » T.C.Chen ■ 
and G. T'ung, 1976]. i 

■ ' ' ■ i 

Mobile information bits can be routed tlirougli an 1 

t 

external buffer whose length can be varied by means of a I 

, , ■ ■ 1 

combinatorial circuit. Thus it is possible to expand or 
contract a data path. An insertion can be performed by 

* ' . • , ''v '' ■ " V'''' 

expanding the data path. Similarly, a deletion can be 
performed by contracting the data path. Garbage collection 
[P.¥. Abraham, 1968] is a special case of the deletion 
operation and .can be performed during each cycle of rotation.! 
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Hoff and Mazor [1971] propose an elementary sclieme 
for adding and deleting data in a shift-register memory. 

A single hit can he inserted within a record during a 
cycle of rotation. The deletion operation results in 
packets of vacant positions. ¥e propose a scheme which ‘ 
enables multiple insertions to he performed during a 
cycle. Multiple deletions can he performed without 
creating packets of vacant positions. 

Such a scheme will he widely applicable in non- 
numeric processing. 

4.2 DHSCRIPTIOH OF THE SCHEME 

A schematic diagram of an ECM- Organization for 
multiple insertions and deletions is shown in Figure 4.1. We 
use the terms ECM and SR interchangeably. 

(m+1) H-hit shift-registers (SRs) in parallel provide 
m-hit word together with one activity hit which is set to 1 
when meaningful information is stored in the corresponding 
location, else it is set to 0 to indicate a vacant position. 

N may range from 512 to 2048. A vsmall size buffer is 
associated with each SR. Such a buffer consists of a 
Parallel-Iii-Serial-Out (PISO) shift-register, tsnpically 
8-32 hits long. 

There is a control unit or an inexpensive micro- 
processor [S. A. Torrero, 1975] which compares m-hit word 
frem SRs with the content of the comparand register (CR) 




















To SR 



Figure 4,2 SELECTOR UNIT 

sets highest index pointing 
to non-vacant position* 
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and issues a command for insertion or deletion ff a string. 

A mask register qan be used to mask certain bitvS while 
performing comparison or any other logical operation. 

Bits from SR enter the buffer through a selector 
unit (Rigure 4.2). A selector unit is a combinatorial 
circuit to connect the SR-output to the buffer by selecting 
the highest index pointing to a non- vacant position. It 
can increment or decrement the switch index by suitably 
modifying the activity bits so as to expand • or contract 
the data path via buffer. It can selectively store in- 
formation bits into the available positions to the left 
of the currently activated switch. All the selector units 
are synchronized and perform an operation simultaneously. 
'Increment (decrement) a switch index', and 'Insert a 
string of inf oimat ion bits' are typical control commands 
which reach a selector unit. 

4.3 OPERATION 

Refer to Eigure 4.3* Switch index is initialized 
to 1/2, where 1 is the length of the buffer. Data path 
in the buffer is 1/2, ^ - 1, ..., 1. Locations +1),...,! 
are unused and store all zeros. Switch index is decremented 
by one for a deletion operation. Suppose k is the index of 
the currently activated switch, (L-k) unused left positions 
are available for consecutive insertions and k positions 
are occupied by infoimiation bits. In order to insert b 
information bits after a specified location , min (b, (L-k) ) 



I 


TO SR 



FROM SEi_E:CTOR UNIT 



FIG.4.3 STATES OF BUFFER AFTER (a ) INITIALIZATION , 

(b) DELETION OF A STRING, (c ) INSERTION OF 
A STRING. 






100 


are stored in the buffer and the switch index is set to 
the leftmost information bit in the buffer. Remaining 
information bits to be inserted (if any) xfait for the 
next cycle. Similarly min(b, k) can be delected during 
the consecutive shift periods and the remaining ones wait 
for the next cycle, b is the length of information to be 
deleted. 

At the end of the file the switch index is initiali- 
zed to L/2. If the index is greater than l/2» it is 
decremented at every occurrence of a shift clock until 
L/2 and a zero bit is stored in the unused vacant position 
of the buffer. If the index is less than L/2 » the switch is 
set to L/2 and vacant positions are added to the end of 
the file. This is a ’garbage collection’. 

Sequence of operations, namely insertion and 
deletion of a string to be perfoimed on a file can be 
categorized as; 

S : a sequence of operations in which an operation 
does not depend on previous ones. 

S^: a sequence of operations in which an operation is 

affected by the execution of previous operations. 

In the case of S^, incomplete operations are 
stored to be scanned during the next cycle. Operations to 
be performed on a file may be stored in a lookaside 


memory 
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In the case of an operation cannot be performed 
if the previous operation is yet to be completed. 

To modify a location, desired data is written into 
that location by successively storing the desired infor- 
mation bits. 

BOM, indeed, forms a bit-parallel and word-serial 
associative memory with a data transfer rate higher than 
what could be obtained from a RAM of equivalent price. 
Further, the ability of an BCM to operate over a wide 
range of transfer rates imparts a great deal of flexibi- 
lity to perform string manipeilative operation. 

4.4 DESIGN METHODOLOG-Y 

Let A and p. be the mean arrival rates of requests 
for insertion and deletion respectively. There may be a 
request for multiple insertions or deletions. In order 
to .determine the length of buffer associated with an 
ECM, we consider the uniform distribution of length of 
insertions and deletions with and as their respec- 
tive mean burst-lengths. Further, we assume that the 
location at which an insertion or a deletion operation is 
to be performed is uniformly distributed over the entire 
cycle. Hence the mean access time is equal to 
for an operation is performed to ccmpleti on within 
the same cycle. With a finite length of buffer, an 
operation may require a few more cycles. 
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A queue -will build up if the mean service time is 
greater than the mean inter-arrival time. Thus the mean 
service time becomes the design criterion. It is desirable 
that a request should be serviced within at most two cycles. 
This poses a constraint, 

1 + 2(b - 1)4. 1/2 , 

where b = max(Aj^t ~ ^ length 

of insertions and deletions is considered to be uniformly- 
distributed over an interval (1, u). In order to determine 
the mean service time, we consider the requests for inser- 
tions. (Similar analysis holds for requests for deletions, 
let Pjj. be the probability of k information bits being in 
the buffer. ¥e obtain the mean service time, 

u 1 

i = i Mg + J I . k) ^ fWg 

where, Z(A , k) is the smallest integer greater than or 
equal to — ). It is to be noted that 

Zia f k) = 1 for u i 1/2. 

A simulation study of the proposed scheme was 

carried cut with o-ur original assimiptions that the location 

and the length of insertions and deletions follow uniform 

distribution. The ratio of the mean service time to the 

cyclic time, x/NT_ is tabulated in Table 4*1 for 1 =2,4, 

s 

8 and 16; and b 1, 1.5, 2.0, 2.5,3«0 and 4.0. The..ratio 
x/NT_ is found to be same for N = 512, 1024, and so on. 
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TABLE 4»1 Mean service time in a Buffered EGM. 

Comparison of analytical resiilts with 




simulation 

results. 
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b 
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L 
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1 
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0.80 1 
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16 
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2 
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4 
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4 
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1.84 ^ 



8 
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Figure 4*4 shows the plots of T, /NT_ versus h 

D o 


forl= 2f At 8 and 16. 


Computation of x can he simplified with the 
assumption of equal state-prohahilities. ¥e obtain 

u L 




NT. 


To illustrate with an example, we consider 1 = 4 
and b = 1.5» Hence u = 1+2 (1.5-1) = 2 and 

X = i NT^ + f S ^ UT^ 

® Jl =1 k=4-Jt+l ^ ® 

= ^ ^s + 

= 0.80 NT 

s 

The corresponding simulation value of x is 0.78 NT . 

s 

The estimated value of are compared with the corres- 

■^s 

ponding simulation values in Table 4*1* They agree well. 
4.5 APPIICATIONS 

Some applications that might benefit from the 
ability of the proposed scheme to perform string-manipula- 
tive operations include interactive terminals, information 
storage and retrieval, symbol manipulation and evaluation 
of recursive functions. 
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Higher-level instructions for string-manipulation 
are most desirable. These might include, TMI (Test with 
Mask and Insert), TMD (Test with Mask and Delete) 
instructions and other instructions for easy testing and 
modification of fields within data words in the buffer. 

4.6 GOHClUSION 

A buffered ECM organization has been proposed to 
perform multiple insertions ahd deletions. An analytic 
method is suggested to determine the birff er-length for a 
given mean burst-length. It*is concluded that a buffer 
of size 16 yields the mean service time of less than 
0.75 cycle for mean burst length <.4*0. 



CHAPTER 5 


BUFFERED STACK MEMORY ¥ITH GONS-T APT CLOCK ECM 

In this chapter a new scheme is proposed for a 
buffered stack memory organization. The earlier scheme 
discussed in Chapter 3 consists of BCMs with variable 
clock rate, whereas in this scheme the clock rate of 
ECMs is considered to be constant. We refer to a dynamic 
shift-register as a constant clock ECM. Addition of a 
few registers on top of shift-register stack (SS), 
dramatically, improves the access time. Buffer consis- 
ting of these registers is periodically initialized to 
half of its capacity so as to facilitate fast insertions 
and retrievals during the recirculation period of SS. 

5.1 lEfRODUCTION 

A stack is a memory with Last-In-First Out (LIFO) 
access discipline. Use of a stack facilitates recursion, 
enables fast interrupt handling,' increases speed of 
computation by virtue of code compression which is in- 
herited from its implicit addressing [C.B. Garlon, 19635 
B.I. Organick, 1973 ] • 

In Chapter 3 » we discussed a buffered stack memory 
organization with variable clock BCMs. In this chapter, 
we propose a new scheme for a buffered stack memory 
consisting of a small size fast stack. (FS) and large 
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size ECMs with constant clock rate, e.g., dynamic shift- 
registers. ¥e use the terms ICM and SR interchangeably. 

G. Philokyprou and A. Zacharakopovilos [1968] give 
a design of a stacking register. It is called a hubble- 
up register as a data word propagates through the register. 
In our scheme, the design of a fast stack employs the 
technique of multiple insertions and deletions in an 
ECM as given in Chapter 4. With the use of selector units, 
the content of the fast stack is initialized periodically 
(loosely speaking) to half of its capacity. 

5.2 DESCRIPTION OP THE SCHEME 

A buffered stack memory consists of m N-bit 
SRs (ECMs) in parallel and a fast stack of size L. Typical 
values for N may be 512-2048 and for D 4-32> m corresponds 
to the size of a word. The clock rate to ECM is considered 
to be constant. Pigure 5.1 depicts a single bit configura- 
tion of such a buffered stack memory with constant clock 
ECMs. The fast stack (PS) consists of high speed registers. 
We refer to the upper half of PS as and the lower half 
as Associated in this scheme is a Parallel-In- Serial- 

Out (PISO) shift register , B^ of a size equal t o that of 
B-. SELECT OR-0 is the selector unit associated with B . 

P 'O 

This selects and dynamically sets the highest switch 

index pointing to a nonvacant position in B^. If B^ is 

empty its switch index-0 is set which connects it to the 

input of SR. Output of SR is an input to SE1BC'’’0R-1 



SELECTOR - 
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associated with This selector unit dynamically sets 

the highest switch index pointing to a vacant position in 
B^. If B^ is full, its switch index-0 is set which 
connects it to the input of BELECTOR-0.. Counter FSC 
indicates the c^xrrent content of PS. Counter C^ records 
distance of the tail of ic where C^ records distance of 
the head of ic from the output port of SjR. The schematic 
diagrams of the selector units are shown in Figure 5.2. 

5.3 OPERATION 

Information stored in SR recirculates via 
SElECTOR-1 and .SELECT OR-0. Buffer is initialized to half 
of its capacity periodically (loosely speaking). As the 
head of the information chunk (ic) reaches the output port 
of SR, SElECTOR-i selectively transfers the top of the ic 
through the highest switch index to the vacant position in 
B^* Transfer takes place during the consecutive shift 
periods imtil B^ is fiill. Then information flows via 
SELECTOR-0 hack to the input port of SR. If B^ is non- 
empty, the content of B^ is transferred in parallel into B^ 
and B^ is left empty and becomes available for possible 
insertions during the recirculation period. Depending on 
the occupancy of B^ SELECTOR-O seta the highest switch 
index' pointing to a nonvacant position in B^. 

A request for insertion waits if FS is fo-und full. 
Similarly a request for retrieval waits if PS is found 
empty. 
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5.4 mathematical MODEL 

In order to obtain the probability of buff er-imder- 
flow and overflow, a Markov process containing two absorbing 
states is developed. This is illustrated by a specific 
example. Refer to Figure 5.3* Let L = and E^ are 

the absorbing states. E^^ corresponds to the state when a 
request for Retrieval occurred after the fast stack (FS) 
became empty during the recirculation period. E^ corresponds 
to the state when a request for insertion occurred after 
FS became f-ull during the recirculation period. Buffer is 
initialized to E^* 


INITIALIZA'^IOH TO ST' ATE 



Figure 5*3 STATE-TRAHSITIOH DIAGRAM 
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Transition probability matrix M can be written as shown 
below: 




E 


o 




B 


2 


B, 



E. 


1 0 

i 0 

0 4 : 

0 0 

0 0 

0 0 

0 0 


0 0 

1 0 

0 i 

1 0 

0 i 

0 0 

0 0 


0 0 

0 0 

0 0 

i 0 

0 4 - 

i 0 

0 0 


0 

0 

0 

0 

0 

i 

1 


E 


E. 


E, 


B. 


Be 


Bet 7c(iia) be the probability vector at the n-th 
instant. Eollowing recurrence relation can be used to 
calculate the probability vectors. 

7i:{n+l) = ■n:(n) .M 


At the insta.nt of initialization, the probability vectoh 
is 

7i<l) = (0 0 0 1 0 0 0) 

Subsequent probability vectors are, 

%( 2 ) = (0 0 1/2 0 1/2 0 0 )' 

%(3) = (0 1/4 0 1/2 0 1/4 0 ) 

It (4 ) = ( 1/8 0 3/8 0 3/8 0 1 / 8 ) 

Tt ( 5 ) = ( 1/8 3/16 0 3/8 0 3/16 1 / 8 ) 

MS ) = ( 7/32 0 9/32 0 9/32 0 1/32) 

The first and the last entries of it(n) are identical and 
correspond to the ciuaulative probabilities until n-steps. 
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We define, 

9 ^( 11 ) = Pro^Lfirst passage througli E _2 at the n-th 

interval] 

It is to he noted that 


9i(2m) = 0 , if 

1/2 is even 

9 ^( 2 m+l) = 0 , if 

1/2 is odd 


Further, 

9^(i) =7i_^(i) --rt_^(i~2). 

5.4.1 mean ..access TIME 

There is a single source of equally likely requests 

for insertion aX'd retrieval. Generation of requests is 

stopped if a request waits in a state of buffer overflow or 

underflow. In order to achieve the specified arrival rate 

of requests (y)» "the source should generate request with an 

arrival rate y , such that v = (1 - . P 

e of uf ' e of 

Puf are the probabilities of buffer- overflow and xmd erf low 
respectively. For a buffer of length L, P^^ = Pj^ and 
P.j^ = Pq. Pj^ = '•Prob[ finding the buffer with k items]. 

In order to determine Pj^, we consider an M/M/1/1 
queueing system in which the system can hold at most a 
total of 1 items. Any further arriving item will be 
refused entry to the system and will depart immediately 
without service. Requests continue to be generated. For 
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insertion-rate of X and retrieval rate of [i, we obtain 


Of course, we also ha.ve 

Pj^=0 fork>l. 

inalysis of tlie finite storage M/M/1/1 queueing system 
is given by l.Kleinrock [l975» Queueing Systems, Vol.l, 
pp. 105 - 105 ]. In order to solve for we use probability 
conservation constraint to obtain 


P = [1 + I (1/n)^] 

° k=l 


kn-1 




1 - (X /^i) 
1 


I+l 


Por X = ^ I P^ = 


and 


k o 


for 0^ k^ 1. 


Hence we obtain tiie effective request rate as follows: 




Probability distribution function of the time for the 

n-th request to occur is computed as below, 

^n-l 


= / U 


We obtain the mean waiting time for the n-th request, 
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W, 


J (NTg-t) d H^(t) 

n -1 (y„NT„)^ 

: m il > I — fT--^exp(-Ye IJT )] 

s j_Q 0 • © s 

n (y NT 

- -^[l - i — I - . - NT^)] 


0=0 


0 i 


^Mean access time, T-j^, in a ‘buffered stack memory organi- 
zation can be computed as follows: 


T^ =2 I ^ ¥ <p^ (n) 

^ n=(L/2)4-l ^ ^ 


1 r. 


Mean access time in a stack memory consisting of entirely 

unidirectional shift-registers is denoted by T , that is 

u 

NTg/2. 


5*4.2 PBRPOfiMANCE-IMPROVEMENT (t) ) 


As in Cliapter we define the perf oormance- 

iifiprovement (t)) of a buffered stack memory {Figure 3.1(b)) 

over an ent irely^inaid ire ct ions 1 shift-register stack memory 

(Figure 3.1(c)) as follows: 

' T 

T) = (1 “ ■^)x 100 percent 
. u 

The design objective is to obtain the length of the 
buffer such that an improvement in performance of the buffered 
stack approaches that of the ideal stack memory (Figure 3.1(a) )» 
that is- 100 percent, and the price/bit that of the unbuffered 
SR-stack manory (Figure 3.1(e))* -y 
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TABUS 5.1 Performance- improvement of a tjuffered stack 
memory with constant clopk BOM. 

Analytical resi^lts. * 
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Table 5.1 records the analytical values of r] for 
N = 512, 1024 and 2048; 1 = 2, 4, 6, 8, 12 and 16; and 

MRT/T^ = 25, 50, 75, 100, 150, 200, 250, 300, 350 and 400. 

o 

MET is the mean inter-request time, that is l/(x+M-). 

5.5 VERIFICATIOE OF THE ABOVE MODEL 

In order to verify the analytical resxlLts, a 
simulation study of the proposed scheme was carried out. 
The program was written in GPSS-III. Int er- re quest - 
arrival times are considered to be exponentially distri- 
buted. Mean arrival rates for insertion and retrieval 
requests are assumed to be equal. A request is serviced 
by the buffer. A request for insertion waits if the 
buffer is found full at the instant of its arrival. 
Similarly a request for retrieval waits if the buffer 
is fo-und empty at the instant of its arrival. The buffer 
is initialised to half of its capacity after a fixed 

period of 1TT„. 

s 

A simiilation experiment runs for 5000 requests. 
Confidence intervals of the mean access time and the 
performance- improvement of the buffered sta.ck memory, are 
determined as follow.^*, let a be the standard deviation of 
the waiting time. Standard deviation for the mean waiting 
time, T-j^, can be estimated as o/ys, where S is the number 
of requests in a simulation experiment. The 95 percent 
confidence- interval for mean access time lies between 
(T^ - 1.96^1--) and T^ + 1.96 y|-), and that for the 
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performance- impro-vement lies between (r)-c) and (ri+c) 
where c = ^ | - ^ 

Tables 5.2(a) and (b) compare the analytical 
results with the simulation results for E =512 and 1D24 

t 

respectively. Each table records the values for T-j^ and 

p for L = 2, 4, 6, 8 and 12; and MRT/T„ = 25, 50,, 75, 

s 

100, 150, 200, 300 and 400. 

■ '* 

In many applications where we want to fit a known 
mathematical distribution, the gamma distribution gives 
a reasonable fit. Erlang distribution is a specific case 
of .l-he gamma distribution with an integer Erlang constant 
E. Wh^ 'Erlang conceived was the notion of decomposing 
the service time distribution into a collection of 
structured exponential distributions [l. Eleinrock, 1975]. 
Table 5-3 records the simulation values of T-j^ and r) for 
MRT/T^ = 100, 200, 300 and 400; L = 2, 4, 8 and 12; and 

O' 

E = 1, 2 and 8, B = 1 corresponds to an exponential 
distribution end B = oo to a constant distribution. It 
is observed from Table 5.3 that t) increases and c 
decreases with an increase in E. Thus the increase in E 
results in the decrease of i)oth the mean access time and 

T-. " - 

the variance. These are the expected results. 

% . 

So far we considered a request for a single 
insertion or a single ret r^val. In many applications, a 
request occurs in the form of a burst. Burst-length may 
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TABLE 5*2 Buffered stack memory wltli constant clock ECM. 

Comparison of analytical and simulation res\ilts. 

(a) N = 512, E = 1. 




SIHUIATIOE 

ANALYTICAL 



SIMULATION 

analytical 

!RT 

L 


ri±c 


N 

MET 

) ^ 

^b 

r]±o 

'^b 

u 

25 

2 

123.6 

52.0+2.2 

112.8 

55.9 

50 

2 

108.4 

97.9+2.0 

105.5 

98.8 


4 

40.0 

84.4+1.3 

48.3 

81.1 


4 

31.3 

87.9+1.1 

39.7 

84.5 


6 

17. 6 

93.4+0.8 

24.4 

90.5 


6 

11.5 

95.8+0.6 

16.5 

93.6 


8 

8.6 

96. 9+0. 6 

11.8 

95.4 


8 

4.7 

98.5+0.4 

6.5 

97.5 


12 

2.6 

99.3+0.3 

3.2 

98.7 


12 

0.2 

100. +0.1 

0.9 

99.6 

75 

2 

98.7 

61 . 8+1 . 8 

98.2 

61.6 

100 

2 

92.4 

63.9+1.9 

90.9 

64.5 


4 

26.3 

89.9+1.0 

31.2 

87.8 


4 

20.8 

92.2+0.8 

23.9 

90.7 


6 

6.1 

97.5+0.6 

10.2 

96.0 


6 

5.6 

98# I rjhO « 4* 

6.2 

97.6 


8 

2.7 

99.3+0.3 

3.1 

98.8 


8 

1.7 

99.3+0.4 

1.5 

99.4 


12 

0.2 

100. +0.1 

0.3 

99.9 


12 

0.1 

100. +0.1 

0.1 

100. 

150 

2 

73.6 

70. til. 9 

76.6 

70.1 

200 

2 

71.8 

72.3+1.7 

63.6 

75.2 


4 

16.5 

93.8+0.7 

13.7 

94.6 


4 

12.4 

95.4+0.6 

8.1 

96.8 


6 

3.1 

98. 9+0. 3 

3.5 

99.0 


6 

. 2.5 

99.3+0.6 

l^l 

99. 6 


8 

0,4 

99.8+0.1 

0.4 

99.8 


8 

0.4 

100. +0.1 

0.1 

99.9 


12 

0.0 

100. 

o.p 

100. 


12 

0.0 

100. 

0.0 

100. 

300 

2 

53.0 

79.3+1.3 

43.' 2 

83.1 

400 

2 

44.0 

82.8+1.2 

•29.8 

88.4 


4 

6.6 

97.7+0.4 

3.2 

98.7 


4 

4.6 

98.3+0.4 

1.5 

99.4 


6 

1.1 

99.7+0.2 

0.3 

99.9 


6 

0.2 

100. +0.1 

0.1 

100. 


8 

0.1 

100. +0.1 

0.0 

100. 


8 

0.1 

100. +0.1 

0.0 

100. 


12 

0.0 

100. 

0« 0 

100, 


12' 

0.0 

100. 

0.0 

100. 
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TABLE 5-2 Buffered stack memory with constant clock A'CM. 

Comparison of analytical and simulation results. 


(b) N = 1024, E = 1. 




SIMUIATIOE 

ANALYTICAL 



SIMULATION 

analytical 

RT 

X T^) 

L 

^b 

ri+c 

Tb 


MET 
(x T^) 

L 

^b 

Tj+C 

^b 

rj 

25 

2 

286.2 

44.2+2.4 

232.9 

54.5 

50 

2 

243.6 

52.6+2. 2 

225.6 

55.9 


4 

94.4 

81.7+1.5 

105.3 

79.4 


4 

79.9 

84.6+1.3 

96.6 

81.1 1 


6 

40.4 

92.2+1.8 

57.0 

88.9 


6 

36.5 

93.0+1.7 

48.9 

90.5 ! 


8 

21.9 

95.9+1.3 

29.2 

94.3 


8 

17.6 

96.7+0.6 

23.6 

95.4 i 


12 

8.0 

98.5+0.8 

9.7 

98.1 


12 

5.3 

99.1+0.3 

5.5 

98.7 


16 

3.5 

99.5+0.5 

3.1 

99.4 


16 

1.3 

99.9+0.1 

1.4 

99.7 

75 

2 

233.6 

54.5+2. 2 

218.3 

97.4 

100 

2 

220.0 

57.3+2.0 

211.0 

58.8 


4 

69.2 

86. 6+1. 2 

88.0 

82.8 


4 

60.5 

88.3+1.1 

79.3 

84.5 


6 

29.9 

94.4+1.4 

40.8 

92.0 


6 

24.0 

95.6+1.2 

33.0 

93.6 


8 

11.5 

97.7+0.8 

18.0 

96.5 


8 

8.7 

98.5+0.4 

12.9 

97.5 


12 

3.2 

99.5+0.4 

3.6 

99.3 


12 

1.1 

99.9+0.1 

1.9 

99.6 


16 

0.4 

100. -0.1 

0.4 

99.9 


16 

0.0 

100. 

0,1 

100. 

50 

2 

193.0 

62.5±1.8 

196.4 

61.6 

200 

2 

185.0 

63.9+1.9 

181.8 

64.5 


4 

51.5 

90.1+1.0 

62.5 

87.8 


4 

45.8 

91.3+0.8 

47.8 

90.7 


6 

16.8 

95.4+1.0 

20.4 

96.0 


6 

12.8 

97.7+0.8 

12.5 

97.6 


8 

5.3 

99.1+0.3 

6. 3 

98.8 


8 

3.3 

99.4+0.3 

3*1 

99.4 


12 

0.6 

99.9+0.1 

0.5 

99.9 


12 

0.4 

100. -0.1 

0.2 

100. 

00 

2 

150.0 

70.7+1.9 

153.3 

70.1 

400 

2 

147.0 

70.9+1.7 

127.1 

75.2 


4 

28.2 

94.5+0.7 

27.5 

94.6 


4. 

^ 26.7 

94.7+0. 6 

16. 2 

96.8 


6 

8.1 

98.5+0.6 

5.0 

99.0 


6 

4.4 

99. 3+0.4 

3.1 

99.4 


8 

1.3 

99.5+0.1 

0.9 

99.8 


8 

0.6 

99.9+0.1 

0.3 

99.9 


12 

0.2 

100. -0.1 

0.0 

100. 


12 

0.0 

100. 

0.0 

100. 
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TABLE 5*5 Buffered stack memory vrith. constant clock BCM. 

Effect of variation in Erlang distribution on 
performance. Simulation results. E = 512 





E=1 


B^2 

B=8 

MET 

(x Tg) 

L 

^b 

ri+c 

^b 

'n±c 

.^b 


100 

2 

92.4 

63.9+1.9 

90.6 

64.9+1.7 

73 , 6 

71.6+1.7 


4 

20.8 

92. 2+0.8 

19.0 

■ 92.6+0.8 

19.0 

92.6+0.7 


8 

2.7 

99 . 3 + 0.3 

1.5 

99 . 7 + 0.2 

1.0 

99 . 7 + 0.1 


12 

0.2 

';#■ 

100. 0+0. 3 

0.1 

lOO.C+0,1 

0.0 

100.0 

200 

2 

71.8 

72.3+1.7 

60.8 

73 . 6 + 1.3 

60.0 

76 . 8 + 1.3 


4 

12,4 

95,4+0.6 

8.9 

96.9+0.5 

5.8 

98.1+0.3 


8 

0.4 

100.0-0,1 

0.1 

100.0 

0.0 

100.0 


12 

0.0 

100.0 

0.0 

100,0 

0.0 

^^100.0 

300 

2 

53.0 

79 . 3 + 1.3 

46.2 

81. 9+1.1 

42.0 

83 . 6+1 . 0 


4 

6.6 

97.7+0.4 

3.9 

98.9+0.3 

0. 8 

100.0-0.1 


8 

0.3 

100 . 0 - 0.4 

0.0 

100. 0 

0.0 

100.0 


12 

0.0 

100.0 

0.0 

100.0 

0.0 

100.0 

400 

2 

44.0 

82.8+1.2 

34.4 

87.3+0.9 

22,4 

91 . 2 + 0.6 


4 

4.6 

98. 3+0. 4 

1.7 

99 . 3 + 0.2 

0.1 

100.0 


8 

0.1 

100.0 

0.0 

100.0 

0.0 

100.0 


12 

0.0 

100.0 

0.0 

100.0 

0.0 

100.0 


MET = Mean inter-re quesj; time* 1= Length of buffers, 
' • , (fast stack) 

T^ = Mean access time, t] = Performance improvement, 

c = Confidence interval. 
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TABLE 5*4 Effect of ■b-ursSt-leng’th. on'tiie performance of 
a buffered stack memory with, constant clock 
EGMs. Simulation results. F= 51 2. MET = lOOT^* 


MRT=100T„ MET = 100T„ 

o ^ .. S 


b 

L 

(z T^) 


b L 

(x 


1 

2 

90.8+4.5 

0.177 

3.0 2 

706. 8+18. 2 

1.40 


4 

21.0+2.1 

0.041 

4 

305.0+9.4 

0.595 


8 

1 • 4^0 • 4 

0.003 

8 

95.3±4.8 

0.186 


12 

0. 1+0.1 

0.000 

12 

40.0+2.8 

0. 080 


16 

0.0 

0.000 

16 

19. 4+0. r 

0.038 

1-5 

2 

228.4+8* 2 

0.446 

4.0 2 

1049.4+28.5 

2.05 


4 

H 

O 

0.140 

4 

460.7+12.1 

0.900 


8 

12.7+1.6 

0.025 

8 

171.5+4.3 

0.335 


12 

2. 8+0. 7 

0.005 

12 

74.1+4.0 

0.145 


16 

0. 1+0.1 

0.000 

16 

37.4+2.9 

0.073 

2.0 

2 

420.0+12.0 

0.820 

4.5 2 

1123.5+32.6 

2.19 


4 

144.1+5.9 

0. 281 

4 

539.0+15.2 

1.03 


8 

32.4+2.7 

0.063 

8 

2l 2 . 6+7 . 5 

0.410 


12 

11.6+1.4 

0.022 

12 

98,4+5.0 

0.192 


16 

3. 7+0. 7 

0.007 

16 

48.0+3.3 

0.093 

2.5 

2 

564.5+16.0 

1.100 

b = Mean burst -length 


'4 

216.7+7.8 

0.423 

MET = Mean inter-request time 


8 

56.0+3.5 

0.109 

T^ = Mean access time 



12 

22.0+2.2 

0.042 

T^c= 95 

percent confidence- 


16 

9 . 4+1 . 4 

0.018 

limits for T-j^. 
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follow a certain distribution. We conducted a simulation 

study with uniformly distributed burst -1 engt hs . A burst - 

request waits if buffer becomes either full or empty during 

the recirculation period. Probabilities of buff er- underflow 

and overflow are expected to increase with an increase in 

burst-interval. The ratio of the mean access time to the 

recirculation period, NT_ is considered a figure of merit. 

s 

Table 5.4 records T^ and T^/NT^ for L = 2, 4, 8, 12 and 

16; MET = 100 T J IT = 512; and mean burst-length, b = 1, 

s 

JL 0^ f Stud. 4- • 5 • 

5.6 DISCUSSION OF RB??ULTS! 

Table 5.1 serves as a design guide table. To 

illustrate with an example, for L = 4, and MRT = 100 T , 

s 

n = 90.7 percent for N = 512 and p = 84.5 percent for 
N = 1024. Tables 5.2(a) and (b) compare the analytical 
results with the simulation results for H = 512 and 1024. 

It is to be noted that ri depends on the product P = yNT„, 
where l/y is the mean inter-request time (MRT). P denotes 
the average number of requests occuring during NT_. We 
define a constant f equal to 10.24,p can be expressed in 
terms of f as given in Table 5.5. 

It can be verified from Tables 5.1 and 5.2 
that n for a specified value of P is same for N = 512, 

1024 and so on. The sim-ulPition values tally within the 
specified confidence interval. 
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TABLE 5.5 



MRT( 

P 

in t 

erms 

of f. 





25 

50 

75 

100 

150 

200 

300 

400 

512 

2f 

f 



if 




1024 

4f 

2f 


f 

¥ 




2048 

8f 

4f 


2f 

¥ 

f 

¥ 



For T] greater than 95 percent with U = 512 a buffer 

of length L = 6 is selected for f/2, that is MET 2,100 T . 

s 

The plots of T) versus buffer-length are shown in 
Figure 5.4 for P = 2f, f/2 and f/4. The analytical curves 
are fairly close to the corresponding simulation curves. 

For P > f/2, analytical values of t) a.re slightly less 
than the simulation values and hence yield a conservative 
estimate for L. For p <f/2 analytical values of p are 
slightly more than the cornesp ending simulation values. 

The plots of p versus MET are shown in 
Figure 5.5 for 1 = 2, 4 a.n.d 6. The <analytical curves 
are close to the simulation curves except for lower 
values of I with p < f/5 and p > 2f. 





Simulation (with N=512 ) 
Analytical 



MRT (x Ts) with N=512 






M3T. CLOCK EC 


IN / 



CM l CD 
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Effect of Brlangian distribution on the performance 
of the buffered stack memory can be observed from Table 5 •3* 
The simifL at ion- study of the buffered stack memory verifies 
that the mean access time and its variance decrease with 
an increase in the value of Erlang constant, E. 

The plots of T, /NT versus mean burst length (b) 

are shown in Figure 5* 6 for 1=2, 4, 8, 12 and 16; and 

N = 512. ¥c observe that the mean access time increases 

linearly at the rate of 0.61 FT , O. 3 O NT and 0.15 NT 

s s s 

por unit of mean burst -length for 1=2,4 and 8 beyond 
the mean burst length, b = 1, 1.5 and 2.5 respectively. 

5.7 APPII CAT IONS 

Also refer to Chapter 3* A stack facilities recur- 
sion enables fast interrupt handling, increases speed of 
computation by virtue of code compression which is inherited 
from its implicit addressing, provides one for one corres- 
pondence between source language operators and machine 
instructions, preserves code and data, conditions to 
facilitate rapid environment change [B.I. Organick, 1973> 
R.¥. Doran, 1975]. 

Application of stacks ca.n be noticed in multi- 
programming, subroutine operations, language translation, 
dynamic programming, efficient processing of object 
programs written in structural language, like AlGOl. 
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MU5, HP3000, EDF9, B5500, B6700, B7700, and 
PDP-11 use stacks. In order to reduce storage referoncesj 
these computers use t op-of-stack registers. The HP 5 OOO 
has four such registers and B77O0 has as many as 32 
[R.¥. Doran, 1975]. 

5.8 conclusion 7 

A scheme has been proposed to design a buffered 
stack memory with constant clock ECMs. Addition of small | 
size high speed registers (FS) on top of a large size 

. ( 

stack with ECMs reduces the mean access time drastically, j 

i 

B\iffered organization overcomes the asymmetric access to 

I 

shift -regist er stack by virtue of periodic initialization j 

I 

of the fast stack (FS) to half of its capacity. Improvemej 

' ' ■ fi 

in performance over an unbuffered shift-register stack is i 
considered the design criterion. ■ 

P 

I 

The design guide table is prepared to facilitate I 

the selection of a buffer-length for the given values of j 

the mean inter-request time (MET), the performance- 

improvement (r|) and the size of shift-register stack (N). j 

For instance, a fast stack of size 6 may be selected to 

achieve the performance-improvement (t)) greater than: 

95 percent with a shift-register stack of size N — 512 

for MET >. 100 T . 

s 



CHAPTER 6 


ASSOCIATIVE ECM OR&AinZATION 

In this chapter a conceptual design of an associative 
electronic cyclic memory is proposed. An associative operatii 
is performed in two phases. Search operation is carried out 
in phase 1, and follow-up operations in phase 2. m large sii; 
shift-registers together with m small size buffers form a ; 

basic cyclic memory mod'ule (BCM) which provides m-bits of a I 

* 

word in parallel and words are sequentially accessible (see { 
Pigure 6.1) size of the content addressable memory (CAM) is 
about cm e fourth of the size of the array of BCM s. Arithmetic 

I 

and logical operations are performed by a processing •unit S 

■ i: 

(PU). Addition of parallel-in-serial-out (PISO) shift- | 

registers enables one to perform string manipulative opera- | 

1 

ticms. M'ultiple search operations over a word during a i 

'[■ 

single shift-period enhances the parallelism. 1 

6.1 INTRODUCTION | 

'■ ' ' ^ . ' ■ ■' [ 

Associative operations are based on content addres- 

i 

sing rather than location addressing. The basic associative ; 

■■ ' ' ' ■ '■ f 

operation is SEARCH for a record with a key satisfying a 
specified relation wit h a given argument . Relations may be j 
'equal to* , ’less than', 'between limits', 'next higher*, 

' ' : ■ ' ' ' ■ i 


tea z imum * , and t he ir c ount erp art s 
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Follow-up operations may "be 'pick first match' , 'pick 
all matches*, 'find its address*, 'read. e. record with succes- 
sful match', 'simple or multi-write in the specified locat ion; 
and so on. A sequence of arithffi.etic operations may precede a 
WRITE operation. 

A comparison logic unit is required to perform ; 

SEARCH operations. Distribution of comparison logic over I 
memory cells gives rise to a wide spectrum of associative ■ 
memories (AM) which includes fully parallel, word -parallel i 

and bit-serial, bit -parallel and word-serial and block j 

' •! 

oriented random access memories. Fully parallel associative | 
memory is most expensive as comparison logic is associated | 
with each bit. Bit -parallel memory is less expensive than 1 

■ ' s 

word-parallel but results in excessive processing time. i 

■ ■ ' (? 

Block oriented random access memory (BORAM) appears to be j 

. " ' ■ ■ ■ , ' ' 

suitable for information storage and retrieval [B.Barhami, 

1975]. I 

Slotnick [1970] proposes a logic-per-track disk i 

memory which is suitable for applications which suffer from | 
high cost of random access memories or performance degradation 
due to frequent transfers between primary and secGndai'y | 

memories. Lee's distributed logic memory [C.Y.Lee,1963] 

■ " A; i 

has attracted attention of many researchers. It consists j 

of a control unit and a large number of identical cells, 

■■■ \ ■ ' ■ ■ . . ■ ■ ' ' 

each of which stores one character of information. Cells 
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can communicate with two adjacent neighbours and the ; 

control unit. Cells can evaluate input conditions inde- 
pendently and in parallel. Operations on variable lengthf 
data become easier in such an organization. ; 

Electronic cyclic memory (ECM) suffers from large ' ! 

access time due to sequential access. However, it has i 

advantage of very high .. data rates. The mobility of j 

■ ■ i 

information bits and the ability to vary clock rates j 

I 

facilitate string maniptilative operations. i 

[ 

The high ratio of data area to key area in a record 
enables the designer to reduce the size of CAM. Sequential 

! 

comparison of a data word stored in CAM with a set of search^ 
arguments dirring a single shift -period will improve the 
parallelism. 

6.2 DESCRIPTION OF THE vSCHEME 

Associative Electronic Cyclic Memory consists of 
an array of basic cyclic memory modules (BCMs), match 
flip-flops, routing logic unit, processing unit (PU), a 
few parallel-in-serial-out (PISO) shift -registers and a 
content addressable (bipolar) memory of a few words. 

A BCM consists of m storage planes to provide m 
bits of a word in parallel (refer to Figure 6.1). Each 
storage plane consists of a small size buffer and a large 
size ECM with variable clock rate, e.g. , static shift- 
registers (SRs). let L and N be their respective sizes. 




FLLOW-UP ^ OPERATION 
HERE 


searchJ_operation 

HERE 


FIG. 6.1 (a) BASIC CYCLIC MEMORY MODULE (BCM) 



FIG, 6.1 Cb] AN ASSPCIATIVE ECM organization 









135 


The content addressable memory is of q words where q is about 
one fotirth of the size (a) of array of BGM*s. pxq flip-flops 
store the match response for p-search arguments being compared 
consecutively during a shift-period. Processing unit (PU) 
performs arithmetic and- logical operations as required after 
a successful match. Routing logic unit facilitates data 
movement from one BOM to another. 'Copy file' operation 
requires only change in data path. PISO, shift-registers are 
useful to perform the operations, like 'insertion of a string* 
'deletion of a string' and ' garbage collection ' . Clock’ to a 
BCM can be independently controlled. It can be inhibited for 
a certain interval of time. ; 

6.3 OPERATION I 

Refer to Figure 6.1, Information within a BCM j 

' " S; 

rotates past the output port of SRs. As the beginning of 
a record is detected at the output port of SRs and CAM is 
found busy, the corresponding clock is inhibited. SEARCH 
operation is performed within CAM. At the completion of a ; 

search operation over key area, CAM selects the ready BCMs 

I 

with inhibited clocks and SEARCH operation is initiated. 

■ ■ 

As comparison within a CAM is much faster, multiple 
comparison operations can be performed during a* single 
shift-period. Match response corresponding to each search 
operation is st ored in match flip-flops associat ed with those 

' f 

BCMs. Eey area words move through the buffer. As the 



136 


beginning of a record in a BOM reaches the output end of the 
buffer, the corresponding match flip-flops are interrogated 
by the processing unit. Subsequently the clock to that B CM 
is inhibited if any follow-up operation is to be performed 
over that record. Otherwise information keeps recirculating 
with the normal clock-rate. In the case of a change of data 
path PU’transfers control to the routing logic unit and 
becomes available to execute other operations. Routing is, 
essentially, connecting the output end of buffer to the 
input of a specified BOM. PISO shift-registers are used to 
perform operations, like 'Insertion of a String', 'Beletian 
of a string' and 'garbage collection' [Om Vikas and 
V. Rajaraman, 1976]. 

A coimter is associated with every BCM. It is set 
to zero at the start of search operation. It is incremen- 
ted with a shift clock. The search operation for a given 
set of arguments is complete over the entire storage, when 
all the counters are reset to zero. 

6.4 DESIGN METHODOLOGY 

A record can be partitioned into two parts, namely 
key -area and data-area. Search is to be performed over 
key— area. Let key-area be of fixed length, say, 16 words. 

A word may be a byte or a character. Data-area is assumed 
to be of Variable length. Let its mean length be 48 
words. Hence about 16 records can be st ored in a BCM of 
1024 words. Clock-rate may vary from 100 EHz to 5 MHz, 
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that is a shift-period may vary from 200 nsec to 10 psec. 

In other words »a clock can he inhibited upto 50 normal 
shift-periods . 16 shift-periods are used up in performing 
a search operation over a key-area. The worst case occurs 
when the beginning of recozxis coincides for all BCMs. 

Three groups of BCMs can be interrogated without exceeding, 
the permissible clock delay. Hence the size of GlM shovild 
be one third of the size of array of BCMs. However, this 
event is a rare one and may be overcome by * inhibit-ahead ’ 
technique, that is clocks to some of BOM* s are put to normal 
rate when majority-coincidence event takes place. Majority- 
coincidence event can be defined as the occurrence of 3q. 
or more BGM*s showing the beginning of records. This 
technique enables one to reduce the size of CAM which is 
an expensive ingredient. We consider the ratio q/a =1/4. 

A comparison operation in a CAM is complete within 
40 nsec ., whereas a normal shift-period lasts over 200 nsec. 
This time-differential facilitates four sequential search 
operations to be performed safely. Hence p =4. 

The search response is available at the end of key- 
area. Boll ow-up operation is performed depending on the 
search response. Key-area is stored in the buffer and is 
available for the desired opeo^at ions which would have not 
been possible in the absence of such a buffer. The size of 
buffer, 1, is eqtial to the length of key area. In our 
example Ii = 16. 
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Design of PISO shift -registers "being used for 
insertion/dcletion of a string is disussed in Chapter 4 
and also in [Om Vikas and V. . Rajaraman, 1976]. 

The processing imit can "be a microprocessor. 

6.5 APPLICATIONS 

Figure 6.1 is a conceptual schematic diagram of 
an associative ECM organization "being used for non-numeric 
applications, e.g., sym"bol manipulation, and information 
storage and retrieval. _ 

6.6 CONCLUSION 

A conceptual design of an associative electronic 
cyclic memory is presented. This seems to "be a cost- 
effective memory organization for microprocess or- "based 
information storage and retrieval systems. 

The proposed scheme is suita"ble for varia"ble length 
records. Search and follow-up operations are performed 
at different points which enhances the performance and 
simplifies the logic circuitry. Insertion and deletion 
operations are performed during the same cycle in which the 
search succeeds. Gar"bage collection operation can "be 
performed as a special deletion operation. 



CHAPTER 7 


CONCLUSIOHS AED TOPICS FOR FURTHER RESEARCH 

In this final chapter, we present the significant 
results of the thesis and attempt to put them in some 
perspective with respect to their potential for practical 
applications. The research topics which are deemed to be 
of immediate importance are also mentioned. 

7.4 SUMMARY CP RESULTS 

Hew cyclic memories, like CCDs and Magnetic Bubbles 
fill the price-performance gap between semi conduct or /core 
RAM, and magnetic disk and drum. We put these memories 
in the class of Electronic • Cyclic Memory (ECM). Two 
characteristics, namely the mobility of information and the 
ability to vary clock rate make them suitable for non^ 
mrmeric processing, stack-baaed operations and microprocessor- 
based application systems. 

We propose buffered ECM organizations with various 
access disciplines, namely EIEO and IIEO. *We observe that 
the appropriate buffering of an ECM considerably enhances 
its performance. In particular, the periodic initialization 
of b-uffers with constant clock ECMs could yield even 100 
percent improvement. 

A scheme for a buffered ECM with EIEO access 
discipline is discussed in Chapter 2. Two small buffers 
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on both, of the ends of an BCM are used to cater to random 
input-output requirements. It is concluded that it is 
preferable to use a constant clock-rate. Table 2.1 serves 
as the design-guide table with p erf orma.nc e-improvement as 
a design criterion. Buffers of size 6 added to an ECM of 
size 512 exhibit an improvement in performance by more than 
95 percent for a mean inter-request-time, MET > IGO T_. 

In ^rder to compute the fractional loss of information in 
a, -buffered serial memory, a computationally convenient 
analytic method is developed for the analysis of M/G/l/L 

'N ■ 

and G/M/l/L finite queue Skinner's models. These two models 
are shown to be duals. Table 2.3 records the fractional 
loss » of inf ormation and can be used to select an optimal 
buffer-length. Por example, buffers of size 12 with a 
constant clock EGM of size 512 yield a fractional loss of ‘ 
information of less than 10 Simulation study was 
carried out to verify the above models. For the loss 
system, the analytical results agree well with the simula- 
tion results. However, in the wait system, the analytical 
values differ slightly with the corresponding simulation 
values for buffer lengths which are much larger or much 
smaller than -^Y NTg, where Y is the mean request-rate. 

Figure 2.6 shows a fast increase in the value of the 
performance-improvement (t}) with a small increase in 
the buffer-length. 
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We have proposed two buffered stack memory organi- 
zations with variable clock ECM in Chapter 3» and with 
constant clock ECM in Chapter 5 * A queueing model of the 
buffered stack memory with variable clock ECM is developed 
from first principles. Table 3*1 records the analytical 
results and can be used as a design-guide table with the 
performance-improvement as the desiga-criterion. The 
analjrtical values agree well with the simulation values 
except for very small values of buffer-length. 

The design of a buffered stack memory with constant 
clock SOM employs the novel technique for miiltiple inser- 
tions and deletions in an ECM which is discussed in 
Chapter 4. This technique facilitates multiple 
insertions and deletions by means of expanding and 
contracting the data path. 

Table 5*1 records the analj'tical values of 
performance-improvement and can be used as a design-guide 
table. The analytical results agree well with the simu- 
lation results except for the buffer lengths which are 
much larger or much smaller than k comparison of 

the buffered stack memory with variable and constant clock 
ECM is shown in Eigure 7.1. Variable clock ECM exhibits 
better performance for buffer length 1 = 2. For 1> 4> the 
perfoiroance improves faster with constant clock ECM. The 
performance-improvement of 100 percent could be achieved 
with constant clock ECM of size 512 by using a buffer of 
size 8-16. 



Performance - Improvement (t\) 
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SimiJilation study of the above schemes shows reduction 
in the mean and the variance of access time (refer to 
Tables 3*5 and 5*3)« 

Effect of variable length biorst-insertions and 
retrievals is also studied (refer tojligures 4-4 and 5»6), 

Mean access time increases linearly with an increase in 
burst-length, A for A >b, where b = (L+2)/4* It is desirable 

' i 

that an insertion or deletion operation be complete within S 

. 

at most two cycles. Hence the buffer-length is selected such - 
that l+2(b-l) — b is the mean burst-length. ; 

An associative ECSM organization is proposed in ; 

Chapter 6. The ability to vary clock-rate and the high ; 

' ' i 

ratio of data area to key area are exploited to reduce the \ 

■ ' ' ' ■: i 

size of the content addressable memory which is an expen- ! 

I 

sive ingredient in this system. j 

' 

Simulation is a very expensive design tool. We have ( 

■ I 

given analytic methods to determine buffer-length. These 

methods are easy to use in practice. 

' 

7.2 TOPICS FOR EUET’ HER RESEARCH ' | 

Emergence of electronic cyclic memories has a major 
impact on computer architecture. The ability to vary clock 
rate and the property of information-mobility in such memories 
open a new horizon for their applicoticn in various areas, 
namely microprocessor based systems, non-numeric processing. 
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and stack- oriented applications. Hence there is need to 
pursue research in this area* 

The following is the list of topics which need 
further research. 

Hardware 

* Development of a micro-controller for an ECM to 
be used in a computer system, is of paramount importance. 
VariOTis scheduling disciplines and data organizations 
can be considered. 

* A conceptual design of an asassociative ECM 
organization is discussed in the thesis. Detailed 
logical design and fabrication still remain to be done. 

Data Structure 

We feel that ECM will emerge as cost-compatible 
memory for micro-proces.sor based systems for non-numeric 
processing. The. problem of data structure in an ECM 
requires immediate attention. Due to the property of 
information-mobility and varying clock-rate, the emergence 
of more efficient data structures is postulated. 

Analysis 

* Analysis of a buffered electronic cyclic memory 
with other scheduling disciplines, like Shortest latency 
Time Eirst (SITE) and Shortest Processing Time Eirst 
(SPTE), and data organizations, like file and page will 
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be usefiil in perf ormance-eval'&ation. Fuller's analysis 
of rotating storage units [1975] may be the guiding 
material in the initifil st^gevS of the roseprch work. 

* Analytic models are needed be developed for 
bulk insertions and retrievals in FIFO and UFO access 
disciplines. 

* Analysis of the pro;^sed scheme for Diultiple 
insertions and deletions' in a shift-register will be 
useful to quantSy the performance of» micrpprocessor- 
based systems where this scheme will be widely appli- 
cable. 

* We have suggested in this thesis a computationally 

convenient method to analyse the finite queue Skinner's 
model with server-walking time. This is with FIFO access 
discipline. Development of a similar model with UFO 
access discipline will be useful to study the behaviour 
of buffered stacks. ■ 

Micro-computer Architecture 

* Kartashev [1976] proposes a design of a micro- 

• > 

computer using a shift-register memory. Buffered SR 
stack-based design of a micro- computer will bG"'%,n 
interesting project. 
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APPEEDIX A 

We have proposed the schemes for the buffered constant el oc 
•EGM organizations with FIFO access discipline in Chapter 2 and- 
with LIFO access discipline in Chapter 5 , and the buffered 
variable clock ECM organization with LIFO access discipline in 
Chapter 3* Tke Scheme for multiple insertions and deletions 
in a buffered BCM has been discussed in Chapter 4. In this 
appendix, we use the Computer Design Lang-uage (CDL) - a ' 

highly descriptive language [Y. Chu, 1972], to describe the 
above schemes. 

We use the terms SR and ECM interchangeably. 

A-1 BUFFERED SERIAL MEMORY WITH CONSTANT CLOCK ECM 

A scheme is proposed in Chapter 2 to realise a serial 

memory with constant clock ECMs. Addition of buffers on the 

% 

input and output ends cater to the random insertion and , 
retrieval requirements. A CDL-description of the proposed 
scljeme is given as below. 

Comment, Configuration of a buffered serial memory 

Register, Cl(0~9), s Tail-distance counter 

G2(0-9), : Head-distance counter 

SRC(0-ip)» • Shift-register content counter 

IBR(0-7), : Input buffer register 

OBR(0-7)» • Output buffer register 

1(0-3) fJ (0-3) 5 Switch index pointers 

F(0-2), j C^eratiosi register 

RI , ; Insertion request . flipflop 

^ R^ j Retrieval request flipflop ^^ ^ ^ 
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Array-Regist er , 

Array-shift - 
register, 

Gas-register, 

Cas-shift~regis- 

ter. 

Decoder, 

Terminal, 

Switch, 

Clock 

Comment, 

/START (ON)/ 
Comment, 

/RI *P(1) / 

/RI *F(2)/ 
Comment, 

/RR * P(l)/ 

/RR*P(2)/ 


Bl(l-8,0-7), : Input Buffer 

B2 (1-8, 0-7), i Output Buffer 

SR( 0-1025, 0-7) : Shift -registers in 

parallel 

CB = Bl(8).. .Bl(l)-B2(l). . .B2(8) 

CSRl = Bl(8):. .B1(1)-SR(0-1023) 

CSR2 = SR(0-1023)~B2(l-8) 

Z(l-4) = P 
PATHl = K(l), 

PATH2 = Z(2), 

PATH3 = K(5), 

RBCL = Z(4), 

START(ON) • 

Mr 

P(l-2) : Clocks for data transfer 

between buffers 

CIK, : Clock for shift -register 

Start operation 

SRC;«- 0, 0, 0, Cl-^ 0, C2-«-0, P-^-1 

Req^uest for insertion sets RI 
IF(I^8) THEN I ^ I 
ELSE RI-^ 0; 

Bl(l) IBR, RI 05 
Request for retrieval sets RR 
lE(J^O) THEN OBR B2(J) 

ELSE ER 0; % 

J-*- dec J, ER -^05 
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Comment, 

/E(1)*P(1)/ 

/K(1)*P(2)/ 


Comm ent , 
/K(2) * CIK/ 


Comment , 
/K(3) * CLK/ 


Comment , 
/K(4)*CLK/ 


/E(4)*Z*P(1)/ 


m 


Path! is activated when CR is empty, 
IP(J=8)'rHBIT F -^2; 

CB shr CB 
I dec I, J inc Jj 

Activate, Path2 when tail reaches input port 
IP((I=0) + (SRC=1023)) then F -^4 
ELSE CSRl^shr CSRl, 

I dec I, SRC -^inc SRC, C2-*'inc C2; 

Activate Path3 when head reaches output port 
IF(J=8) THEN F 4 ELSE 
(IE(SRC=0) THEN F -^1 ELSE 
(CSR2 shr CSR2, J -*-inc J, 

SRC dec SRC, Cl -^inc Cl))j 
Rest of information recircula+es 

SR cir SR, 

Cl inc Cl» 

C2 Inc C2» 

Z *■ 1, 

IF(C1=0) T’HEN F -*-2 ELSE 
(IF(C2=0) THEN F -^3), 

0 ; 




BNB 
















I 

J 








A-2 BUPFESED STACE MEMORY WITH VARIABLE CLOCK EGM 

A scheme, for buffer fed stack memory is p^roposed in 
Chapter 3 ‘The stack memory consists of large size static 
shift -registers and a small size fast stack. A CJ)1- 
description of the proposed scheme is given as 'below. 


C omment , 


Configuration of a buffered stack memory 
organization 


Register, 


Array-r egist er , 

Array- shift- 
register,- 

Gas-shift- 

register, 


BR(0-15), 

:|^nput buffer register j 

S(0-15), 

. ! 

: Regiv=!ter with SS 

SRC ( 0-11 ) , 

: SR-stack content counter ; 

ESC (0-3), 

: ES-contaat counter 

0(0-1), 

: Operation register 

E(O-l), 

; ES-status register 

ELAO, 

: Initial insertion indicator 

AVL, 

: SS-availability indicatox 

RI, 

•r 

: Request for insertion 

RR, 

m ' ' 

j Request for retrieval 

ES(0-7 ,0-15), 

: East stack 

SS( 0-1023, 6i^l5), 

; Shi ft -register stack 


C SS( 0-1024 ) = S-SS(0-1023), 

■ - , 


Gas-register, 
Comment , 
Switch, 


CES = ES-BR 
Control Structure 

ST ART (OR), ; Start operation 

ST OP (OR), : Stop operation 



Decoder, 

K(0-5) = G 




C(l-5) = P 



T erminal , 

SS(TOP) = SS (1025, 0-15), 

9 

9 

Top Of vSS 


PS(TOP) = PS(7,0-15), 

9 

9 

Top of PS 


PS(BOT) = PS(0, 0-15), 

9 

9 

Bottom of PS 


PIND = K(0), 

9 

Find Operation 


DONE = K(l), 

9 

9 

Operation dene 


POPUP= 2(2), 

9 

9 

Retrieve a word 


PUSH = 2(5), 

9 

9 

Insert a word 


PSMT = C(l), 

9 

PS is empty 


PSPT = C(2), 

9 

9 

PS is partially 




filled 


PSPI = C(5), 

e 

9 

PS is full 

Clock, 

CD2, 



Comment , 

Start Operation 




/STARTCON)/ PSC -^0, AVI-^1, G-^l, ffZIP^O, TRANS^ 0, 


REORG 0} 

Comment, Find a request 

/Z(0)* RR/ G "^2, 

IE(ESC=0) THEN P -,.1; 

/K(0)* RI/ G - 

IP (PSC=G) THEN P^l RISE 

(IP (PSC=8) THEN P-^ 5 SiSE Ff- 2) ; 

/E(l)/ RR -^O, RI “^0, G-^ 0; 

_ •;# 

Comment, Request for retrieval and PS is CTipty 
/E(2)*C(1)*AVL/ AVI '^0, 

IP(P1AG=1) THM (Sm-e 1) 
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v ELSE('^RANS-^ l); 
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/SKIP* OIK/ CSS shr CSS, 

SRC ^ dec SRC, 

PlAG ^ 0 , SKIP -^0; 

/TRARS* CLK/ IP(SRC=0) T'HBK GO TO ERR ELSE 

(BR(0-15) -^SSCTOP), 

SS ^shr -.-SS, 

G-^ 1, AVI ^1, TRARS^O)? 

Comment, Request for Retrieval and PS is nonempty. 

PS-v BR- 

n 

/K(2)* C(l)/ CPS shr CPS, 

PSC dec PSC, G^lj 

Comment, Request for Insertion and PS is empty. 

/K(3>C (1)*AV1/ S -^BR, 

CPS shl CPS, 

PSC ■<-inc PSC, 

A . , 'f 

PLAG -^1, G^l, REORG-<-l| 

Comment, Reorganize Shift-register stack memory 
/REORG^CLK/ IP (SRC =1024) THBR GO TO ERR ELSE 

CSS reel CSS; 

Comment, Request for insert j on. PS is found partially 
filled. BR-^ PS 

/K(5)* C(2)/ CPS^shlCPS, 

PSC -fine PSC, 

G 1; 



Comment , 
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Request for insertion. FS is found full. 
Skip on FLAG-. 

/E(3)*C(3)*I'’LAG/ CFS shl CFS, 

FLA& -^0, G 

Comment, Push into SS when FLAG is reset. BR ^ FS. 

/Z< 3 ) *G ( 3 ) *A¥L*FMG/ S ^ FS (BOB ) , 

CFS shl CPS, RBORG^l, G-<-l; 
/ERR/ STOP(OR) ^ 

END 

Comment, a special operator to shift circularly 
1024 times 

Operator CSS reel CSS 

/BEGIN/ COUNT = 1024, AVL-^ 0 

/REORG * CLK/ CSS ^ cir CSS, 

COUNT -s- dec 'COUNT 

IF ( COUNT = 0) THEN (REORG^ 0, 

1 ) 

end of operator. jit 




(Continued ) 
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FLOWCHART FOR THE BUFFEREI) S'^ACK IVIEHORY WITH 
VARIABLI-CLOCE BCM- 









« 
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A-3 A SCHEME EOR JIULTIPIE INSERTIONS AND DELETIONS IN AN ECM 

A scheme for iriwsertion and deletion of a string in 
an ECM is discussed in Chapter 4- CDL-de script ion of the 
same is given as helow. Extreme states, namely empty and 
full states, of SR are not considered. 


Comment, Configuration of buffered electronic cyclic 

memory. ¥e consider that' maximum 7 woirds 
ean.be either inserted or deleted by a single 
operation. 

Register, 0E(0,2), : Operation Register 


¥C(0,2), 
CR(0-7), 
MASE(0-7), 
LC(0-9), 

RI, 

RD, 

MATCH, 

IBS, 

Array regist er, M ( 0-7 , 0-8 ) 

Blip (1-8, 0-87, 

Shift -regist er, SR{0-1023 » 0-8) 


; Word co'unt 
; Comparand register 
: Mask Register 
5 Location counter 
5 Request for insertion 
: Request for deletion 
: Match indicator 
: Unsuccessful search over 
entire memory 
t flip- flops 

: Memory to store words to 
be inserted 
; Buffer registers 
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Gas -shift regis^t 

Decoder, 

Terminal, 

Switch, 

Clock, 


Comment , 

/start (ON)/ 
/K(0)*{BIM.RI)/ 
Comment, 
/E(1)*CLK/ 

/K(1) *X*P(1)/ 


/E(1)*Y*(P(2)/ 

Comment , 

/E(2) *RD/ 
/E(2)*RI/ 


r ' ■ 

, CSR(I) = SR(0-1023)-BUP(1-I) 

GSR(O) = SR(0-1023) 

E(0-4) = OP 

ACT IV = SR(1023,0) 

JTART(ON) 

STOP (ON) 

P(l-2), : Preqn^cfes of .p(l) and 

CIE, P (2) are more than 8 times 

that of CIE. 

Start upera.tion 

OP “^0, IMS-hO, B( ,0)^0, GO -Hi, X -hO, Y -hOj 
OP -H i, LC-^O, RI-hO, RD-h 0; 

Search for a given argument in OR 
IP(LC=1024)THBN (OP-^ 0, UNS -^1) 

ELSE X-^ Ij 

MATCH^ (SR(1023,1) © CR(0) + MASE(o)) 
*(sr(1023,2) 0 cR(i) + MIseTTJ) 

*(SR(1023,8) 0 CR(7) mIseTt) ), 

X -*-0; 

IP(MATCH = 1) THEN (OP -H 2), 

Y -H 0*, 

Op er at i on t o p er f orm 

If". 

QP -H 3, GO-hO; 

OP -^4? 
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Gomment , 
/E(3)* CIE/ 


/Z(3) *X*P(1)/ 
/Z(3) *X»P(2)/ 

C onmient » 

/Z(4) * CLZ/ 


/K(4) *Z* P(1)/ 
/E(4) *X*P(2)/ 


Comment, 
/K(5)* CLK/ 
Comment , 
/E(6)* CLZ/ 
Comment, 
/CO* CLZ/ 


/GO *Z *P(1)/ 
/GO* X *P(2)/ 


Deletion of a string 
IP(¥C=0)TH1N (GO-^ 1, OP^- 0) 

ELSE (IE (1=0) THM* (SRC 4.0, OP^ 5 ) 

ELSE (CSR(I) '^cir CSE(I), X-t-l)); 

M ' • 

BI3E(l,0)-(- Oj 

I't- 3eie, I, WC*- dec VC, X -^-O j 
Insertion of a string 
iE(VC=0) THEN OP^O 
ELSE (IE (1=8) THEN (LC-h 0, OP ^ 6) 
ELSE X -^1); # 

BUE(I) *-M(VC) 

I -t- inc I , 

VC -e dec VC, 

X >^0 ; 

Deletion extends over ne^ cycle 


IE(LG = 1023) THEN (OP -K 3, GO^ 0 ); 


Ins ^t ion extends over next cycle 

'i' ’ _ ' ■ 

IE(LC = 1031) THEN OP 4 
Information' recirculates within SB 




IE (ACTIY 0) THEN 
(IE(I>’4) THEN X-^1 ELSE 
(IE(I < 4) THEN I-e inc I)), 

CSR(I) Gir CSR(I), LC ^ inc LC 
BtJE(l,0)-t-0 
I *-dec I, X*- 0 
END 
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A-4 BUPi’ERED STACK MEMORY WITH CONSTANT CLOCK ECM 

In this section a ner scheme is proposed to realize 
a stack memory using dynamic shift -registers a-iid a .small 
size "buffer (Chapter 5 )• Information within SR recirculates 
at a fixed clock-rate. There are virtually two information 
stacks, namely, one in ES and another in SR. information 

stack in SR keeps moving and can he accessed only wlien its 
top breaches the output port of SR. At this movement PS 
is initialized to half of its capacity. The scheme is 
discussed in Chapter 5. CDlr^d ascription of the same is 
given as helow: 


Comment, 


Register, 


Configuration of buffered stack 
memory with constant clock ECM. 


Cl (0-9), 

• 

Tail distance cotxnter 

C2(0-9), 

» 

Head " distance counter 

PSC(0-3), 

• 

PS-content counter 

BR(0-7), 

• 

0 

Buffer register 

1(0-2), ) 
JO-2, ) 

» 

« 

Switch index pointers 

Q(0-2), )‘ 

P(O-l), 

w 

• 

Operation on PS 

T(0-2), 

. « 

• 

Operations on SS- 

RI, 

• 

• 

Insertion request flipflop 

RR, 

• 

» . 

Retrieval request flipflop 

X,Y, 

• 

Intermediate setting flipflops. 
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Comment, 

/KP(3) *P(1)/ 
/KPC^) *2 *P(2)/ 


Comment , 

/KF(2) * P(l)/ 
/KF(2) *Z*P(2)/ 


Comment , 

/KS(0)/ 

/¥/ 

/KS(1) *CLK/ 


Comment , 
/Z/ 


/TEAKS/ 


Store a word in ES. BE PS (TOP) 
CPS^shl CPS, 

IP(I=0) THM J^-inc J 
ELSE (I^ dec I), 

P ^1, 

Eetrieve a word from PS. PS (TOP) BE 
CPS-^shr CPS, 

IP (J = 0) THEN I-einc I 
ELSE (J^dec J ), 

P^l, X«-0; 

Test whether head of information chunk 
arrives at the output port of SE. 
IP(C2=0) THEN GOTO ¥ ELSE T 1} 

IP (01=0) THEN (IP(J=0) THEN T«r4 BISS 

GOTO TEANS) ELSE GOTO Z J 
SE<-cir SE, 

Cl-^-inc Cl, C2'5~inc C2, 

T--0; 

Initialize the fast stack 
IP(J=0) THEN 

(lP(Is:0)THPN T-s-l ELSE(I<-dec I, T‘^2) 
ELvSE GO TO TEANS J 
B0^B2, C2<e-J, 1 ^ 3 °, 



Pill B1 from SE 
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Comment f 
/i;S( 2 ) * OLK/ 

/KS(2) *X*P(1)/ 
Comment, 

/ES(3) *CLK/ 

/KS(3) *X *P(1)/ 
/ENPP/ 

Comment, 

/KS(5 ) * CLE/ 

/KS{5) *Y*P(2)/ 

Comment, 

/E:S(4) * OLE/ 

Comment, 

/OVFLO/ 


B1(I)^ SR(OUT), 

SR ^ shr SR, 

Cl in c "t31 , X-^1 i 


GOTO ¥, X^ 0; 

BO joins SR-data path 
CSR(Q) ^ cir CSR(q), 

Cl^inc Cl, 

C2 '*-inc C2, X^lj 

IP(C1=0) THEIJ GO TO ENPPj 

IP(Q=0) THEN T-e O eISE 

(IP(G2=0) THEN GOTO OVPLO ELSE T-.-5); 

Form a normal data path 

CSR(Q) —cir CSR(Q), 

Y^l? 

Q^dec Q, Y— 0, GOTO ENDFj 

Itapty SR 
SR cir SR, 

IF(J ^ 0) THEN GOTO TRANS J 


SR hecames full before BO is emptied. 
STOP (ON) 


END 
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APPEKDIZ B 


This appendix contains the listings of the simulation 
programs, namely M/G/l/I and G/M/l/L finite queue Skinner^ s 
models; Buffered Stack Memory with Yariahle Clock ECM and 
with Constant Clock ECM; and the scheme for mTiltiple 
insertions and deletions in an ECM. These programs are 
written in GPSS III (General Purpose ^sterns Simulator). 



FINITE QUEUE SkINNER^S FCDEL 


♦ SlR^^LATE 

♦ 



M/G/l/L 

QUEUEING 

SYSTEM 

WITH 

SERVER 

; RELAXATION 

TIME 


1 

1 

c 

FUNCTICN 
Q .1 

RN1,C24 
.104 .2 

.222 

.3 

.35 5 

.4 

.5C9 

. 5 

i: 

.69 I 

m 6 

.515 

.7 

1.2 

.75 

1 .38 

.8 

1.6 

.84 

1.83 

. €8 

2.12! 


2.3 

.92 

2.52 

.94 

2.81 

.95 

2.99 

.96 

3.2 

.57 

3.5 

.SB 

3.9 

.99 

4.6 

.995 

5.3 

.998 

6.2 

,999 

7 

.99 98 

8 




1 STCRAGE 6 LENGTH OF BUFFER 

* 

* TINEP =»=» 

GENERATE 16000,,,,,! 

SAVEX 25+,1 

TERMINATE I 

# 

« INITIALIZATION 

GENERATE 

SIZE2 SAVeX 11,512 SIZE CF SR 

NRT SAVEX 12,100 

lORT SAVEX 13,1 

POINT SAVEX 20,5000 

TERMINATE 


* INSERTION 

* 

GENERATE 
GATE LR 
LOGIC S 
SAVEX 
TEST NE 
NARK 

* gate SNF 

GATE SNF 
LOGIC R 
TABULATE 
4 TABLE 

ENTER 
GATE LR 
LOGIC S 
GATE LR 
LOGIC S 
serve ADVANCE 
TABULATE 
1 TABLE 

GATE SNE 
ADVANCE 
LEAVE 
TRANSFER 


X12,FN1 

3 

3 


1+,1 

X1,X20,RESLT 

4 

1,LCST WITH LOsS SYSTEM 

1 WITH WAIT SYSTEM 

3 

MP4, 64, 64, 9 
1 

1,CUMP 


1 

2 

2 

0 

1 


SI, 0,1, 18 


i»RELAX 

X13 

I 


♦ SERVE 



finite queue SKINNE'^'S mccel 


RELAX 

ICGIC R 

X 


ADVANCE 

Xll 


logic R 

2 


TRANSFER 

,OURP 

LOST 

SAVEX 

7+,l 

CUf'P 

SAVEX 

TERMINATE 

6+ , 1 

RESET 

SAVEX 

22, V2 

2 

VARIABLE 

X7*1000*100/X6 


tabulate 

4 


SAVEX 

16,V4 

4 

VARIABLE 

1000-TB4*2000/XM 


terminate 

1000 


start 

END 

10000 



FINITE CUEUE SKINNER*S NCCEl 


* SInULATE 

* G/^/l/L CEEUEING sVSTEF WITH SOURCE RELAXATION TIME 

5j( . 

1 FU^CTIO^ RN1,c2A 

C 0 .1 .10A .2 .222 .3 .355 .4 .5C9 .5 .69 

.6 .915 .7 1.2 .75 1.38 .8 1.6 .84 l.g3 -88 

.9 2.3 .92 2.52 .94 2.81 ,95 2.99 .96 3.2 .97 

.98 3.9 .99 4-6 *995 5.3 .998 6.2 .999 7 .9998 8 

1 STCRAGE 8 SIZE CF BUFFER 

* 

* TIMER 4 * 

GEhERATE 16000,,,,,! 

SAVEX 25+,l 

TERMINATE 1 

* 

^ ** INITIALIZATION 

generate l,,,l,,l 

SIZE2 SAVEX 11,512 

MRT SAVEX 12,100 MEAN INTER REQUEST TIME 

IDRT SAVEX 13,1 

POINT SAVEX 20,3000 NUMBER OF REQUESTS 

ENTER 1,¥2 

2 VARIABLE R1/2 

Terminate 

* 

* RETRIEVAL 

♦ 

Generate xi2»fni 

SEIZE 2 

TEST NE X1,X20,RESLT 

SAVEX l+,l 

GATE SNE 1,MISS 

ADVANCE 1 

LEAVE 1 

transfer ,DRCP 

MISS SAVEX 9+,| 

DRCP SAVEX 8+,l 

release 2 

TERMINATE 

RESLT SAVEX 21, VI 

1 VARIABLE X9*1000=»100/X8 

release 2 

TERMINATE 1000 

♦ 

* INSERTION 

* 

GENERATE 1 

SEIZE 1 


CM 



FINITE CUet'E SKINNER'S NOCEL 



TABULATE 

1 

1 

TABLE 

SI, 0,4, 33 


gate SNF 

1, RELAX 


ENTER 

1 


tabulate 

2 

2 

TABLE 

Sl,Q,l ,33 


ADVANCE 

X13 


TRANSFER 

,FREE 

relax 

advance 

Xll 

FREE 

RELEASE 

TERMINATE 

1 


START 

END 

iooco 



#*## ########### 


BUFFERED STACK WITH VARIABLE CLOCK ECM 


* SIMLATE 


FERFORHANCE EVALUATICN CF A STACK HIERARCHY 

STACK CCKSISTS CF A FAST STACK AND A LARGE STATIC SHIFT REGISTER 


XI IS DVKAHIC STORAGE CF FAST STACK ANC X2 OF SLOW STACK 
X3 IS VEAK INTER ARRIVAL TINE , X4 CORRESPONDS TO FLAG 
X6 = PRCB { REGUEST IS TC SS FOR RETRIEVAL ) 

XT •= PRCB f RECUEST IS TC FS FqR RETRIEVAL OR STORAGE J 

Xa = PRCB ( REcUEST IS TC SS pCR INSERTION ) 

X9 IS capacity CF FAST STACK ANC XlQ IS CAPACITY OF SLOW STACK 

Xli FOR WAIT STATE , X12 FCR RUN STATE , X13 FqR BLOCKED ST^TE 
XU = REGlEST to go FOR SERVICE , XU = ICOO TIMES THE EFFICIEN 
X17 = ICOC TINES THE F^cH-ITY UTILIZATION 
X18 = E-CCEFFICIENT CF ERLANG DISTRIBUTION 

* XIS = LCST ITERS , X20 = TRANSITION FROM N TO N+i 

♦ 


2 FUNCTICN X21,C24 


0 

0 

1000 

-lOA 

2000 

-222 

3000 

.355 

4C00 

.509 

5C00 

.69 

6C0C 

-915 

7000 

1.2 

7500 

1.38 

8000 

1.6 

8400 

1.83 

8 800 

2.12 

9C0C 

2.3 

9200 

2.52 

9400 

2.81 

9500 

2-99 

9600 

3.2 

9700 

3.5 

9800 

3.9 

99C0 

4.6 

9950 

5.3 

9980 

6.2 

9990 

1 

9998 

8 


« ««««« 


* TIMER 

GENERATE 16000,,,,,! 

SAVeVALUE 5+,l TC TELL NUMBER OF RUNS 

terminate 1 

♦ 

* ** INITIALIZATION ** 

GENERATE 

MRT SAVEX 3,1gO MEAN INTER REQUEST TIME 

SIZE! SAVEvALtE 9,8 SIZE £*= FAST ST^CK 

SIZEZ SAVEVAlLE 10,102A SIZE Cf SR STACK 

rLANG SaVEX 18,1 ERLANG CONSTANT 

POINT SAVEX 30,10000 SAMPLE POINTS 

SAVEVALLE 1,V1 FS IS HALF FILLED 

SAVEVAlLE 2,V2 SS IS HALF FILLED 

SAVEX 3,V9 

AHEAD lERM.INATE 

* 

GENERATE I 

GATE NU 3 

SEIZE 3 

SAVEX 26,X18 

SAVgX 2l,iOOO 

ERLAN TEST LE 1,X26, BEGIN 

SAVEX 21 ,V? 

VARIABLE X21’«RN2/lOOO 


7 



BUFFERED STACK FEf^CRV WITH VARIABLE CLCCK ECM 


SAVEX 

26- t1 


TRANSFER 

,erlan 


* 

:5jc t t t 5^ # 5^ ^ 5^' 5^' 4 ^ ^ ❖ 

* 

GENERATE 

FEGuEct pattern 


# # # :5s 1 3^: Tfs # 't| 3^' 3gt 

BEGIN 

SAVEX 

21 ,V8 

8 

VARIABLE 

R999-X21=»10 


ADVANCE 

X3,FN2 

f»OVE 

RELEASE 

3 


GUEUE 

1 


TEST L 

C1,90»RESLT 


GATE NU 

1 ,LCST 


SEIZE 

1 ENGAGE FAST STACK 


DEPART 

1 


TEST L 

X14 ,X30, RESIT 


SAVEX 

lA+,1 


NARK 

4 

* 

TRANSFER 

.5,INSBT,RETRV 

LOST 

SAVgX 

l9+,i 


DEPART 

terminate 

1 ' , 

■ 

INSRT 

Test l 

Xl,X9, FSFUL , 


SAVEVALUE 

1 + ,1 


TEST NE 

XI ,ltFSEMP 

CUT 

SAVEVALUE 

12+,1 RUN STATE 

FREE 

RELEASE 

1 


TABULATE 

TERMINATE 

2 

FSEMP 

SAVEVALUE 

4,1 


GATE U 

2,IPUN 


SAVEX 

13+,1 


TRANSFER 

, RECRG 

IRON 

SAVEX 

12+,1 


TRANSFER 

,pECpG 

BYPAS 

savevalle 

4,0 


SAVEX 

20+,l 


DECREMENT 

FS BY 1 AND INCREMENT FS BY 1 


TRANSFER 

,FREE 

FSFUL 

GATE U 

2, RUN : 


SAVEVALLE 

13+,1 BLCCKEO STATE 


TRANSFER 

,FLAG 

RUN 

SAVEVALUE 

12+,1 

FLAG 

TEST NE 

X4,l,BYpAS Test pUAg WHETHER BOFS IS TOSS 

RECRG 

SEIZE 

2 ^ ENGAGE SLOW STACK 


Release 

1 


TABULATE 

2 



BUFFERED ST^CK f'EHCRY felTH V^RIjSELE CLOCK ECM 

TABLE WP4,0,2,10 SERVICE TIME 

START 1COCO 

ENC 



# # # # # # # 


BUFFERED STACK f'EMCRY VilTH CCNSTANT CLOCK EC?^ 


^ simulate 

* BUFFERED EIECTPCMC CYCLIC REHORy WITF L IFO ACCESS DECIPLINE 

* 

* ■ — — ^ — — 


* PEPFORRAKCE EVALUATICN OF STACK MEMORY FIERARCHY 

* STACK CCKSISTS. OF A LARGE CYNAMIC SR ANC A FAST STACK TOP 


1 

c 

function 

0 .1 

BN2» 

.109 

C 24 

.2 

.222 

.3 

.35 5 

.9 

.5C9 

.5 .69 


.915 

.7 

1.2 

.75 

1.38 

. 8 

1.6 

.89 

1.83 

.68 2.1 

.5 

•2.3 

.92 

2.52 

,99 

2.81 

.95 

2.99 

.96 

3.2 

.97 3.5 

. 98 

3.9 

.99 

9.6 

.995 

5.3 

.998 

6.2 

.999 

7 

.9998 8 


TIMER ** 

GEKERATE leOCOtfttfl 
SAVEX 25+,T 

TERMINATE 1 

INITIALIZATICN ** 


2 

STORAGE 

8 

LENGTH OF BUFFER 



GENERATE 

Iff f 1 f 'f 

1 


SIZE! 

<AVEX 

1,1029 

SIZE CF SR 


mrt 

SAVEX 

3,100 

MgAN INTER REQUEST 

TIME 


SAVgX 

10, V2 

CURRENT content IN 

SR 

2 

VARIABLE 

Xl/2 



RLANG 

SAVEX 

18,1 

ERLANG constant 


POINT 

SAVEX 

20,2000 




SAVEX 

17tV9 



4 

VARIABLE 

Xi/X18 



HALF 

SAVEX 

5f4 




ENTER 

2tX5 



NUHI 

SAVEX 

TERMINATE 

12,2 

BURST length 



GENERATE 

PECliESTS 

PATTERN 



GENERATE 

1 

, . . ■ 



GATE LR 

3 




TEST LE 

Xl5,X20 

,RESlT 



LOGIC S 

3 




SA^^EX 

26,X18 



ERLAN 

TEST LE 

1.X26, BEGIN 



ADVANCE 

Xl7,FN2 




SAVEX 

26-, 1 




TRANSFER 

,ERLAN 



BEGIN 

advance 

0 



BURST 

SAVEX 

13, VI 

BURST LENGTH 


1 

VARIABLE 

X12=»RN1/1000+1 




buffered stack 

MEMCRY WITH CONSTANT CLOCK ECM 


TEST G 

X13,0, BURST 


TABULATE 

3 


TABLE 

X13,4,4,5 


assign 

3.Xi3 


SAVEX 

15+,1 


TABULATE 

2 

2 

TABLE 

S2fC,l,7 OCCUPANCY OF BUFFER 


KARK 

1 


LINK 

ItFIFC, INCUT 

INCUT 

TRANSFER 

.5, INSRT, CELET 

INSRT 

SAVEX 

6+ ,1 


GATE SNF 

2 


ENTER 

2 


LCCP 

3,INSRT 

PUT 

TABULATE 

1 

1 

TABLE 

MP1,64,64,5 WAITING TIME 


UNLINK 

1, INCUT, 1 

SKIP 

LOGIC R 
terminate 

3 

CELET 

SAVEX 

8+ ,1 


GATg SNE 

2 


LEAVE 

2 


LOCP 

3,DELET 

GET 

TABULATE 

1 

*■ ; ^ 

UNLINK 

1, INCUT,! 


LOGIC R 
terminate 

3 

RESLT 

SAVEX 

16 ,V 6 STACK IMPROVEMENT 

6 

VARIABLE 

1000-Te!*2000/X! 

* 

TERMINATE 

1000 


generate 

1 


GATE LR 

1 


LOGIC S 

1 

SERVE 

TEST NE 

S2,X5, RELAX 


TEST L 

S2,X5,CRAW 


ENTER 

2 


advance 

1 : 


SAVEX 

10~ tl 


test G 

Xl0,0,HEAC 


TRANSFER 

, SERVE 

CRAW 

SAVEX 

19, S2 


SAvEx 

19-,X5 


LEAVE 

2,X19 


SAVEX 

!0+,X19 


TEST LE 

X10,X1,HEAD 

RELAX 

ADVANCE 

XI 

HEAD 

* 

LOGIC R 
TERMINATE 

1 



buffered stack WITH CONSTANT CLOCK ECM 


ST^5?T 

END 


IGOCO 



# # 


MULTIPLE IKSERTICNS AND CELETICKS IN ECM 


* SIf/ULATE 

BUFFERED CYCLIC MEMCRY FCR SYMBOL PROCESSING 

* BUFFERED EcM FACILITATES MULTIPLE INSERTIONS AND DELETIONS 


1 FUNCTION RN1,C24 


0 

0 

-1 

.104 

.2 
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** TIMER ♦* 
generate 160C0 
SAVEX 19+ ,1 

TERMINATE 1 

♦ INIylALIZATICN 


2 

STCRAGE 

8 

LENGTH OF BUFFER 


GENERATE 

Iffflffl 


SIZEI 

SAVEX 

1*512 

SUE CF SR 

MRTT 

SAVEX 

3,i00 

MEAN INTER REQuEST TIME 

NUMI 

SAVEX 

5*1 

MEAN LENGTH OF InS./DEL. 


SAVEX 

10*V2 


2 

VARIABLE 

Xl/2 



ENTER 

2,V5 


5 

VARIABLE 

R2/2 


POINT 

SAVEX 

20,5000 

NUMBER OF REQUESTS 


TERMINATE 




GENERATE 

RECUESTS PATTERN 


GENERATE 

X3,FN1 



TEST L 

# CH1,1 



MARK 

8 



LINK 

1,FIFC*MAMP 

MAMP 

SAVEX 

7,V3 

LOCATION 


mark 

1 


3 

VARIABLE 

X104RN3/1000 


TEST L 

X15,X20, RESIT 

PORT . 

TEST GE 

X7,Xll,MCVE 


SAVEX 

12, X7 

LOCATION YET TQ ARRIVE 


SAVEX 

12-,Xll 



ADVANCE 

X12 

DESIRED LOCATION IS REACHED 


SAVEX 

11, X7 



TABULATE 

3 


3 

TABLE 

$2,0,4,18 



TRANSFER 

.StXPANC, 

SHORT 
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XPAND 

SAvEx 

9, VI BURST LENGTH 

1 

variable 

X5*RN2/1000+l 


Test g 

X9 tO tXPANC 


INSRT 

TEST LE 

X9,B2fPUT 


TEST G 

X1,X10»FULL 


ENTER 

2,X9 


SAVEX 

10+ ,X9 


SAVEX 

ll+»X9 


SAVEX 

15+»1 


TRANSFER 

»free 


SHORT 

SAVEX 

gtVl BURST LENGTH 


test g 

X9,0,SHGRT 

# , ■ ■ 

DELET 

TEST LE 

X9 , S2 , CUT 


TEST L 

KOfXlOtEMPTY 


LEAVE 

2,X9 


SAVEX 

10-tX9 


ADVANCE 

X9 


SAVEX 

16+ a 

# , 

free 

TABULATE 

1 

1 

TABLE 

MP8,64,128,5 


TABULATE 

2 

2 

TABLE 

MPl, 64, 128*5 

OURP 

LNLlNK 

TERMINATE 

1,MAN1P,1 

3 ^ ' ' , 

FULL 

SAVEX 

22+,l 

EHPTY 

SAVEX 

22+,1 

RESLT 

SAVEX 

16, V7 

1 

Variable 

TBinOOO/Xl 


TERMINATE 

2000 


PUI 

SAVEX 

9-,P2 


SAVEX 

10+,R2 


SAVEX 

11+,R2 


ENTER 

2,R2 


ADVANCE 

XI 


ADVANCE 

X8 


LEAVE 

2,X8 


SAVEX 

13+,1 


TRANSFER 

tINSRT 

3 ^ 

RqYE 

advance 

V6 

6 ■ 

VARIABLE 

X2/2+X1-X11 


SAVEX 

11,0 HEAD REACHES POR 


TRANSFER 

»PCRT 
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SAVEX 

9-,S? 

SAVEX 

10-,S2 

LEAVE 

2 fS2 

ADVANCE 

XI 

ENTER 

2tX8 

ADVANCE 

X8 

SAVEX 

I4*»l 

TRANSFER 

tCELET 

START 

END 

IQCCO 
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